Appl. Sci. 2020, 10, 8; doi:10.3390/app10010008 www.mdpi.com/journal/applsci 

Article 

Perspective Morphometric Criteria for Facial Beauty 
and Proportion Assessment 
Luca Ulrich 1,*, Jean-Luc Dugelay 2, Enrico Vezzetti 1, Sandro Moos 1 and Federica Marcolin 1 

1 DIGEP, Politecnico di Torino, 10124 Torino, Italy; enrico.vezzetti@polito.it (E.V.); sandro.moos@polito.it 
(S.M.); federica.marcolin@polito.it (F.M.) 

2 Digital Security, Campus SophiaTech, EURECOM, CS 50193 Biot Sophia Antipolis CEDEX, France; jean-
luc.dugelay@eurecom.fr 

* Correspondence: luca.ulrich@polito.it; Tel.: +39-011-090-7205 

Received: 31 October 2019; Accepted: 16 December 2019; Published: 18 December 2019 

Abstract: Common sense usually considers the assessment of female human attractiveness to be 
subjective. Nevertheless, in the past decades, several studies and experiments showed that an 
objective component in beauty assessment exists and can be strictly related, even if it does not 
match, with proportions of features. Proportions can be studied through analysis of the face, which 
relies on landmarks, i.e., specific points on the facial surface, which are shared by everyone, and 
measurements between them. In this work, several measures have been gathered from studies in 
the literature considering datasets of beautiful women to build a set of measures that can be defined 
as suggestive of female attractiveness. The resulting set consists of 29 measures applied to a public 
dataset, the Bosphorus database, whose faces have been both analyzed by the developed 
methodology based on the expanded set of measures and judged by human observers. Results show 
that the set of chosen measures is significant in terms of attractiveness evaluation, confirming the 
key role of proportions in beauty assessment; furthermore, the sorting of identified measures has 
been performed to identify the most significant canons involved in the evaluation. 

Keywords: face analysis; face proportions; attractiveness; 3D landmarks; features extraction 
 

1. Introduction 

The beauty and proportion of the human face have always been an object of interest through the 
years, as evidenced by Greek sculptures, ancient Egyptian paintings, and even in prehistory [1]. Some 
studies have also been carried on during the Renaissance period by well-known artists, such as 
Leonardo da Vinci [2] and Michelangelo Buonarroti [3], but it is from the 20th century that systematic 
studies focused on objective assessments began. 

Several studies of facial aesthetics have focused on specific aspects, such as the role of the divine 
proportion legacy of ancient Greeks, [4–6], the average intended as an instrument to delete 
imperfections [7,8], the average plus exaggerated sex-specific traits to discriminate between beauty 
and attractiveness [9], and the relative importance of some traits of the face [10]. The presence of a 
strong objective component in beauty assessment has been clearly shown by several experiments, 
including Iliffe’s [11], Udry’s [12] and Cunningham’s [13]. 

Cultural influence on beauty assessment is very strong, as testified by different canons that have 
been adopted through the years to assess female beauty. During the Paleolithic period, statuettes of 
“Venus”, representing women, were sculpted in such a way that they looked full-figured to 
symbolize fecundation, fertility and regeneration [14]. Ancient Egyptians considered a large forehead 
and well-defined mandibles attractive [15]. On the contrary, Greeks preferred an oval facial shape for 
both men and women, and a forehead as small as possible to highlight the hair [16]. In the middle 


Appl. Sci. 2020, 10, 8 2 of 18 

age, there is evidence of a preference for larger foreheads and absence of wrinkles, even if the sign at 
the time was not considered ugly, as testified by contemporary positive reflections on grey hair [17]. 

Cultural differences can also be found within the same period. In recent times, debates over 
femininity depicted by the media have been widely discussed [18], suggesting that beauty relies on 
the eye of the beholder, but there are several experiments, including the above-mentioned, suggesting 
that beauty is assessed through quantitative tips, even if hidden, especially regarding the face. 

As criteria for beauty assessment began to be presented to the scientific community [19], several 
studies focusing on analyzing databases of attractive and common individuals have been conducted, 
from the identification of facial esthetic canons in Italian children in the deciduous and early mixed 
dentition [20] to the soft-tissue analysis of adolescent boys’ and girls’ faces [21]. There have also been 
studies on people of non-Caucasian ethnicity, such as Southern Chinese faces [22]; nevertheless, 
women have been the most widely studied subset of the human population. 

Several works show comparisons between normal and attractive women. The normal term has 
been used in the literature to identify common, non-selected women, whereas attractive is typically 
used to identify good-looking women, typically chosen among actresses [23] or beauty contest 
participants [24,25]. Results appearing in these studies confirm the presence of objective elements 
defining beauty related to the concept of proportions between different parts of the face in terms of 
Euclidean distances and angular measurements. In those works, as in the current research, 
attractiveness is understood as the harmony between facial forms rather than the sexual impact on 
the observer. Objective beauty would be the expression that fits best for defining the kind of 
attractiveness evaluated by human judges in beauty contests, which is the same faced by the current 
work. Nonetheless, it would be superficial to claim that beauty can be totally assessed from a 
quantitative point of view, thus it has been chosen to preserve the attractiveness acceptation intended 
in previously cited works in this field. 

The medical field is the most interesting discipline in studying this topic, since some branches 
of surgery must intervene directly on face by modifying the shape, both for merely aesthetic and 
pathological reasons [26]; therefore, it is not surprising that the study of two orthodontists, Peck and 
Peck [27], was one of the first works aimed at discovering and gathering facial features. 

Over the last two decades, there has been an incremental increase of 3D imaging, such as MRI, 
CT [28] and stereophotogrammetry [29–31], and 3D modelling [32]. In this context, geometrical 
descriptors [33] and landmarks [34] proved to be highly effective tools, since they allow the observer 
to gather common traits that everybody shares to perform facial analysis. This evolution has provided 
the possibility of analyzing point clouds reproducing patients’ faces for diagnosis [35,36] and surgical 
intervention planning, for which handbooks are nowadays widely used [37], but it has also allowed 
us to build virtual faces from scratch [38]. Before the advent of 3D tools, face operation planning 
evaluations relied on two-dimensional images acquired on sagittal, coronal, and axial planes [39,40]. 

The idea for this research project was conceived to make a socially-responsible contribution to 
this field. In past months, a partnership with the maxillofacial department of Molinette hospital in 
Turin allowed the authors of this paper to review all findings linked to facial reconstruction due to 
pathological problems. Very general guidelines for tissue reconstruction after a surgical intervention 
exist, but they are not a valuable auxilium, and physicians need more accurate indications. Obviously, 
the reconstruction target is to obtain a good-looking face, and the present work has been directed on 
that need. Nonetheless, surgeons’ experience is irreplaceable. 

Facial proportions and beauty are strongly interconnected concepts. However, while 
proportions can be evaluated mainly from a 3D point of view, beauty assessment is more complex; 
indeed, facial expression [41] and all elements perceivable with color information cannot be ignored 
[42]. Also, 2D information has been used to manipulate textures [43], figuring out that not only 
attractiveness, but also other individual characteristics, such as age and health, are differently 
perceived because of colors cues [44,45]. Skin is the most studied element in terms of color and 
texture, because of the wide percentage of facial surface which recovers [46–48], and it has been 
proved that factors such as makeup and photo quality have a strong influence on attractiveness [49]. 


Appl. Sci. 2020, 10, 8 3 of 18 

Some databases of human faces were built to be analyzed and to provide new suggestions for 
further feature extraction and proportion studies, but also to validate results already obtained. Two 
available databases are the 3D Facial Norm (3DFN) [50] and the Bosphorus sets, employed in the 
development of the current research work, which provides faces belonging to more than one hundred 
subjects in various poses, expressions and different types of occlusions [51]. 

The present work gathers facial measures that have been identified in previous studies as 
representative of female face attractiveness with the purpose of classifying female faces. After that, 
the Bosphorus, meaning a database of normal women, has been used to test whether the final set of 
canons is suitable and sufficient for women’s attractiveness evaluation. The novelties in this work are 
the development of a methodology capable of putting together data identified by different sources in 
the literature in order to obtain an expanded set of measures comprehensive of all the works 
published up-to-now, and to provide a ranking of the measures included in the expanded set to 
understand which are the most significant in attractiveness assessment. Results confirm that the 
evaluation of selected measures corresponds closely to human assessments, providing the 
opportunity to quantitatively analyze women’s beauty; moreover, a ranking showing the influence 
of measures in women’s facial beauty and proportion assessment has been drawn up. The importance 
of ratios between measures and the higher relevance of the vertical measures compared to the 
horizontal have been highlighted. 

2. Materials and Methods 

Face analysis is the discipline that studies human faces based on the identification of landmarks, 
specific points common to everyone that can be identified on the face. Landmarks can be recognized 
on the hard tissue through palpation or on the soft tissue through observation, even if some of the 
landmarks positioned on the soft tissue depend on landmarks positioned on the hard tissue. In order 
to evaluate women’s attractiveness from a quantitative point of view, an expanded set of measures, 
relying on landmark positions, has been defined. 

The most difficult step in evaluating female beauty is to identify a ground truth that allows us 
to make considerations on measures of women’s faces. Significant experiments proving the presence 
of a strong objective component have been conducted; Iliffe’s [11] and Udry’s [12] works showed that 
almost identical classifications are obtained when people are asked to judge and rank the beauty in 
specific photo galleries, as well as Cunningham’s [13], which stated that beauty cannot be defined as 
a quality resting only in the eye of the beholder. 

In the past decades, several works aiming at establishing which human facial traits influence an 
observer’s assessment of beauty have been conducted. Works considered as the most incisive in this 
field, thus taken into greater account in the present work, have been carried out by Farkas et al. [52], 
Ferrario et al. [23], Sarver et al. [37], Sforza et al. [24] and Galantucci et al. [25].  

Each of those works has been carried out via live-positioning landmarks on female subjects 
before the acquisition of the point cloud required to compute measures. Manual allocation directly 
on subjects has been chosen in order to achieve the best accuracy possible. Acquisitions have been 
made using an RGB-D camera, namely cameras which allow to acquire both color and depth images. 
Subjects were actresses, participants to beauty contests, and common women; the latter have been 
chosen to validate the results. Going deeper into detail, the purpose of computing and comparing 
facial measures has been to highlight the correlation among some measures in the set of attractive 
women and the differences with the same measures in the set of normal women. Whenever the 
difference has been evaluated as significant, the measure related to that difference has been classified 
as influential for women’s facial beauty and proportion assessment. As it can be seen in Table 1, the 
vast majority of measures belonging to the expanded set show a statistically significant difference 
between Galantucci’s attractive and normal women datasets. 

Table 1. Since values of single measures related to the attractive set of women are available only for 
the Galantucci et al. database [25], a t-test has been performed between measures in this dataset and 
those ones in Bosphorus. The significance level has been fixed to 95%. This means that differences 
between measures are significant if the p-value is lower than 0.05. 


Appl. Sci. 2020, 10, 8 4 of 18 

Measure p-Value 
n-pg 0.0000 
n-sn 0.0000 

ch_r-ch_l 0.0268 
ex_r-ex_l 0.0000 

sn-pg 0.9761 
t_r-t_l 0.0008 

ls-(prn-pg) 0.2480 
li-(prn-pg) 0.0128 

ls-li 0.0000 
en_r-en_l 0.0087 
ac_r-ac_l 0.0000 

n-gn 0.0000 
sn-gn 0.4234 
ls-sto 0.0001 
li-sto 0.0000 

n-sn-pg 0.9931 
sl-n-sn 0.1045 

prn-sn-ls 0.1240 
n-prn-pg 0.3796 

ex_l-n-ex_r 0.0105 
mf(pg-n-ls) 0.0219 
en_l-n-en_r 0.0105 

(t_r-t_l)/(n-pg) 0.0728 
(n-sn)/(n-pg) 0.0000 

(sn-pg)/(n-pg) 0.0000 
(t_r-n)/(t_r-sn) 0.0000 
(sn-pg)/(n-sn) 0.0000 
(sn-gn)/(n-gn) 0.0000 
(sto-gn)/(sn-

gn) 
0.0006 

All the information found by those studies were gathered here to obtain an expanded set of 
measures able to evaluate women’s attractiveness; then, the Bosphorus database was used to validate 
the expanded set of measures, comparing it with qualitative evaluations issued by human observers. 
There are 110 subjects in Bosphorus database, but only the 44 selected women have been analyzed in 
this work. 

The landmark framework considered in this work is reported in Table 2 and shown in Figure 1. 

Table 2. The landmark framework. The third column reports a description for each landmark [53]. 

Landmark 
Abbreviatio

n 
Description 

Alar curvature 
point 

ac Point located at the facial insertion of each alar base. 

Cheilion ch Point located at each labial commissure. 

Endocanthion en 
Soft tissue point located at the inner commissure of each eye 

fissure. 

Exocanthion ex 
Soft tissue point located at the outer commissure of each eye 

fissure. 
Gonion (or 

Menton) 
gn (or me) Most inferior midpoint on the soft tissue contour of the chin. 

Labiale inferius li Midpoint of the vermilion line of the lower lip. 


Appl. Sci. 2020, 10, 8 5 of 18 

Labiale superius ls Midpoint of the vermilion line of the lower lip. 

Nasion n 
Midpoint on the soft tissue contour of the base of the nasal 

root at the level of the frontonasal suture. 
Pogonion pg Most anterior midpoint of the chin. 
Pronasale prn Most anterior midpoint of the nasal tip. 
Stomion sto Midpoint of the horizontal labial fissure. 

Sublabiale sl 
Most posterior point on the labiomental soft tissue contour 
that defines the border between the lower lip and the chin. 

Subnasale sn 
Midpoint on the nasolabial soft tissue contour between the 

columella crest and the upper lip. 
Tragion t Point located at the upper margin of each tragus. 

 
Figure 1. Figure showing landmarks studied in the current work. Landmarks 20 (zy_l), 21 (zy_r), 22 
(midpoint of Tragi), 25 (g), 26 (al_l) and 27 (al_r) have been discarded due to a lack of available 
measures involving them. Suffixes _l and _r state that the landmark considered is respectively the left 
or the right one. 

All the measures considered in this analysis have been acquired employing the above-
mentioned landmarks and can be subdivided into three categories: Linear, angular, and ratios. 

Linear measures described in Table 3 and shown in Figure 2 are Euclidean distances between 
two landmarks or between a landmark and another specific point. More specifically, one of those 
specific points is the point on the E-line that minimizes the distance with Labiale superius (or Labiale 
inferius), where the E-line is the line passing through the Pronasal and the Pogonion. 

Table 3. Euclidean linear distances. The third column reports the work from which the measure has 
been taken. 

Measure Description References 
n-pg Facial line Ferrario et al. [23], Galantucci et al. [25] 

n-sn 
Anterior upper facial 2° third 

height 
Ferrario et al. [23], Galantucci et al. [25] 


Appl. Sci. 2020, 10, 8 6 of 18 

ch_r-ch_l Oral length 
Farkas et al. [52], Sarver et al. [37], Galantucci et 

al. [25] 

ex_r-ex_l Upper facial width 
Ferrario et al. [23], Sforza et al. [24], Galantucci et 

al. [25] 
sn-pg Anterior lower facial height Galantucci et al. [25] 
t_r-t_l Middle facial width Galantucci et al. [25] 

ls-(prn-
pg) 

Upper lip to E-line distance Galantucci et al. [25] 

li-(prn-
pg) 

Lower lip to E-line distance Sforza et al. [24], Galantucci et al. [25] 

ls-li Vermilion height Galantucci et al. [25] 
en_r-en_l Intercantal distance Farkas et al. [52], Sarver et al. [37] 
ac_r-ac_l Width nose base Farkas et al. [52], Sarver et al. [37] 

n-gn Facial height Farkas et al. [52], Sarver et al. [37] 
sn-gn Lower third facial height Farkas et al. [52], Sarver et al. [37] 
ls-sto Upper vermilion Farkas et al. [52], Sarver et al. [37] 
li-sto Lower vermilion Farkas et al. [52], Sarver et al. [37] 

 
Figure 2. Euclidean distances. 

Angular measures described in Table 4 and shown in Figure 3 are angles subtended by a vertex 
identified by three landmarks or, exceptionally for the Interlabial distance, by two lines lying on the 
same plane and identified by four landmarks, two for each line. 


Appl. Sci. 2020, 10, 8 7 of 18 

Table 4. The angular measures. The third column reports the work from which the measure has been taken. 

Measure Description References 
n-sn-pg Facial convexity excluding the nose Ferrario et al. [23], Galantucci et al. [25] 
sl-n-sn Maxillary prominence Ferrario et al. [23], Galantucci et al [25] 

prn-sn-ls Nasolabial 
Farkas et al. [52], Sarver et al. [37], 

Galantucci et al. [25] 
n-prn-pg Nasion – Pronasal - Pogonion Galantucci et al. [25] 
ex_l-n-

ex_r 
Left Exocanthion – Nasion - Right 

Exocanthion 
Sforza et al. [24] 

pg-n-ls Maxillo-facial angle (mf) Galantucci et al [25] 
en_l-n-

en_r 
Left Endocanthion-Nasion-Right 

Endocanthion 
Ferrario et al. [23] 

 
Figure 3. Angular measures. 

Ratios between linear distances (Table 5) allow us to perform quantitative evaluations of 
proportions. The face analysis moves the focus from the local to the global point of view, since not 
only the absolute value of one single measure is considered, but rather the overall effect of two 
measures. Intuitively, ratios are the quantitative way to represent the big picture. 

Table 5. Ratios of Euclidean distances. The third column reports the work from which the measure 
has been taken. 


Appl. Sci. 2020, 10, 8 8 of 18 

Measure Description References 
(t_r-t_l)/(n-pg) Middle facial width to facial height Galantucci et. al [25] 

(n-sn)/(n-pg) Nasion - Subnasale/Nasion - Pogonion 
Ferrario et al. [23], Galantucci et. al 

[25] 

(sn-pg)/(n-pg) 
Subnasale - Pogonion/Nasion - 

Pogonion 
Ferrario et al. [23], Galantucci et. al 

[25] 

(t_r-n)/(t_r-sn) 
Right Tragi-Nasion/Right Tragi-

Subnasale 
Galantucci et. al [25] 

(sn-pg)/(n-sn) Lower to upper facial height Galantucci et. al [25] 
(sn-gn)/(n-gn) Lower third/facial height Farkas et al. [52], Sarver et al. [37] 
(sto-gn)/(sn-

gn) 
Mandibula/lower third Farkas et al. [52], Sarver et al. [37] 

As a result of the literature review, twenty-nine measures have been identified. The present 
work employs data from the Bosphorus database, namely women’s pictures, 3D models, and relative 
landmark coordinates. Because all the studies previously cited were carried out by research groups 
with different expertise, slightly different sets of landmarks have been adopted. A landmarking 
expert identified some missing landmarks on 2D pictures and on 3D models on the Bosphorus 
database in order to complete the landmark framework. Nonetheless, some landmarks had to be 
discarded because they relied on the hard tissue, and the only way to identify them was through 
palpation. Thus, it has not been possible to include some measures into the expanded set. An example 
is the zygion (zy), which is the most lateral point on the soft tissue contour of each zygomatic arch. 

The works of Galantucci et al. [25], Farkas et al. [52], Sarver et al. [37], Ferrario et al. [23] and 
Sforza et al. [24] provided mean value and standard deviation for every measure. Some measures are 
common to different sources, even if the related mean value and standard deviation are slightly 
different depending on the study. The small sample size of attractive sets of women is due to the 
complexity of finding available databases providing data, i.e., not only images, but also 3D models 
and information like landmarks position. Indeed, one of the purposes of the present study is to merge 
the information coming from these sources (Figure 4) and build an overall measuring methodology. 

 
Figure 4. Definition of the expanded set of measures. 

A score, the result of the sum of penalties, has been computed for each woman present in the 
Bosphorus database. For each measure, if the value of the considered woman was within the range 


Appl. Sci. 2020, 10, 8 9 of 18 

mean value ± standard deviation, no penalty has been added. For the sake of clarity, it has to be 
highlighted that the mean value taken into consideration is the mean value of the measures belonging 
to the attractive sets of women and not the mean value referred to a generic non-selected group of 
women (normal women). Conversely, if the value was out of range, a penalty has been added, and 
the amount of penalty (1) has been computed as the ratio between the distance of the measure from 
the mean value normalized with the mean value. 𝑝𝑒𝑛𝑎𝑙𝑡𝑦 = 𝑤𝑒𝑖𝑔ℎ𝑡 ∗ | 𝑐𝑢𝑟𝑟𝑒𝑛𝑡 𝑤𝑜𝑚𝑎𝑛 𝑠 𝑚𝑒𝑎𝑠𝑢𝑟𝑒 –  𝑚𝑒𝑎𝑛 𝑣𝑎𝑙𝑢𝑒 |𝑚𝑒𝑎𝑛 𝑣𝑎𝑙𝑢𝑒  (1) 

In the case of more than one mean value, since there is more than one source in the literature 
that refers to the same measure, the computed penalty value has been then multiplied for weight (2). 𝑤𝑒𝑖𝑔ℎ𝑡 = # 𝑤𝑜𝑚𝑒𝑛 𝑖𝑛 𝑐𝑢𝑟𝑟𝑒𝑛𝑡 𝑠𝑡𝑢𝑑𝑦#𝑤𝑜𝑚𝑒𝑛 𝑖𝑛 𝑎𝑙𝑙 𝑡ℎ𝑒 𝑠𝑡𝑢𝑑𝑖𝑒𝑠 𝑟𝑒𝑓𝑒𝑟𝑟𝑖𝑛𝑔 𝑡𝑜 𝑡ℎ𝑒 𝑠𝑎𝑚𝑒 𝑚𝑒𝑎𝑠𝑢𝑟𝑒 (2) 

This weight has been introduced to consider the different degree of confidence assigned to 
different studies found in the literature. Remembering that in those studies canons have been 
extracted from sets of attractive women, it was considered essential to evaluate more robustly the 
analysis using a greater number of subjects. Thus, the weights have been computed as the ratio 
between the number of women involved in a single study and the sum of all the women involved in 
all the studies related to the same measure. For instance, consider a hypothetical measure identified 
by two studies, A and B, which involves x and y women, respectively. The weight related to the study 
A will be x/(x+y), while the weight related to the study B will be y/(x+y). 

After all the scores were obtained, a cluster analysis was performed through the usage of k-means 
methodology, subdividing the datasets into five classes. The purpose of this step was to identify 
which women were closer to the well-proportioned standard face in terms of compliance with the 
measures in the expanded set; in other words, considering the descending order adopted, faces 
belonging to class 5 are closer to the well-proportioned standard face than the faces belonging to class 
1. The number of classes has been chosen to have a correspondence with the Likert scale. Indeed, in 
parallel, a qualitative evaluation of the women present in Bosphorus has been made by a focus group 
using precisely the Likert scale. A comparison between the results obtained using the developed 
methodology and the qualitative evaluation performed by human observers has been performed 
(Figure 5). 

Finally, the results of the developed methodology in terms of cluster subdivision have been 
analyzed to rank the influence of each measure of the expanded set in women’s facial beauty and 
proportion assessment. 


Appl. Sci. 2020, 10, 8 10 of 18 

Figure 5. The methodology used to compare quantitative and qualitative evaluation. 

3. Results 

The measures selected from literature and included in the expanded set are reported in Table 6. 
For every measure, it is possible to have more than one mean value and one standard deviation due 
to the possibility of retrieving the same measure from different sources in literature; the last column 
reports the reference number of the considered source. 

Table 6. List of the final set of 29 measures. Reference [17] analyzes two different beauty contests 
(Miss Italia 2006 and Miss Italia 2007) that provide slightly different values. The year markers of 
“2006” and “2007” have been added in some references to distinguish between those two contests. 

Measure 
Mean 
Value 

Standard 
Deviation 

Reference 

n-pg 
97.34 4.03 [25] 
99.06 5.54 [23] 

n-sn 
50.29 2.43 [25] 
52.38 2.76 [23] 

ch_r-ch_l 
45.7 2.77 [25] 
50 3.2 [23] 

ex_r-ex_l 

84.01 2.98 [25] 
101.59 4.97 [23] 

95.2 3.3 
[24]—
2006 

92.5 3.5 
[24]—
2007 

sn-pg 48.14 2.65 [25] 
t_r-t_l 131.52 4.19 [25] 

ls-(prn-pg) 3.95 1.8 [25] 

li-(prn-pg) 
2.32 1.31 [25] 

2.4 1.4 
[24]—
2006 


Appl. Sci. 2020, 10, 8 11 of 18 

1.7 1.3 
[24]—
2007 

ls-li 17.97 2.29 [25] 
en_r-en_l 32 2.4 [37,52] 
ac_r-ac_l 31 1.9 [37,52] 

n-gn 112 5.2 [37,52] 
sn-gn 66 4.5 [37,52] 
ls-sto 8.4 1.3 [37,52] 
li-sto 9.7 1.6 [37,52] 

n-sn-pg 
163.55 4.37 [25] 
164.02 3.71 [23] 

sl-n-sn 
9.33 1.98 [25] 
8.48 1.72 [23] 

prn-sn-ls 123.12 9.53 [25] 
n-prn-pg 131.12 4.07 [25] 

ex_l-n-ex_r 
124.9 3.5 

[24]—
2006 

120.8 4.2 
[24]—
2007 

pg-n-ls 7.93 2.1 [25] 
en_l-n-en_r 120.29 5.24 [23] 

(t_r-t_l)/(n-pg) 1.35 0.06 [25] 

(n-sn)/(n-pg) 
0.52 0.02 [25] 

0.5341 0.017 [23] 

(sn-pg)/(n-pg) 
0.49 0.02 [25] 

0.4659 0.017 [23] 
(t_r-n)/(t_r-sn) 0.96 0.02 [25] 
(sn-pg)/(n-sn) 0.9588 0.0600 [25] 
(sn-gn)/(n-gn) 0.586 0.029 [37,52] 
(sto-gn)/(sn-

gn) 
0.691 0.028 [37,52] 

After the expanded set definition, each of the forty-four women in Bosphorus database were 
measured, and the scores were computed summing all the penalties. Scores were subdivided into 
five clusters so that it is easy to identify women’s faces closer to the well-proportioned standard face, 
remembering that clusters are arranged in descending order; namely, faces belonging to cluster 5 are 
the closest to the well-proportioned standard face, and conversely, faces belonging to cluster 1 are 
the furthest. 

Hereafter, a focus group composed by 10 people, 6 men and 4 women aged between 19 and 39 
years old met up to evaluate Bosphorus women attractiveness. For every face, a discussion between 
focus group members led to a final score. In other words, the whole group provided a single score 
for each face analyzed. The aim of that was to compare the developed methodology with human 
judgment. The results showed that the cluster label and the Likert’s scale label are the same; namely, 
the difference between those two values equals 0 for 15/44 women (very high correspondence); the 
difference is 1 for 16/44 women (high correspondence); the difference is 2 for 9/44 women (moderate 
correspondence); the difference is 3 for 3/44 women (low correspondence); and the difference is 4 for 
1/44 women (very low correspondence).  

Very high correspondence means that the result of the method matches the focus group’s 
outcome (a woman with very high correspondence label has been judged in the same way both by 
the method and by the focus group). 

All those results are reported in Table 7. 


Appl. Sci. 2020, 10, 8 12 of 18 

Table 7. Scores, cluster labels, qualitative evaluation labels and correspondence between quantitative 
and qualitative evaluations for each woman. 

Subjec
t 

Score 
Cluster 
Label 

Qualitative Evaluation 
Label 

Correspondenc
e 

1 0.7608 5 4 High 
2 0.7868 5 3 Moderate 
3 5.9095 1 2 High 
4 2.2549 3 3 Very high 
5 3.3508 2 2 Very high 
6 0.8030 5 4 High 
7 2.8187 2 2 Very high 
8 1.3236 4 3 High 
9 0.7395 5 3 Moderate 

10 1.0387 4 4 Very high 
11 2.9316 2 1 High 
12 1.5750 4 3 High 
13 2.6975 2 2 Very high 
14 2.0539 3 3 Very high 
15 1.4707 4 2 Moderate 
16 2.6951 2 1 High 
17 1.5603 4 1 Low 
18 0.4124 5 4 High 
19 0.7826 5 4 High 
20 2.2078 3 2 High 
21 4.1747 1 1 Very high 
22 3.4563 2 2 Very high 
23 3.2831 2 2 Very high 
24 2.5096 3 1 Moderate 
25 1.8256 3 2 High 
26 4.5707 1 1 Very high 
27 2.3485 3 2 High 
28 4.3690 1 3 Moderate 
29 1.3455 4 1 Low 
30 5.4693 1 1 Very high 
31 3.4337 2 1 High 
32 4.2277 1 2 High 
33 2.9829 2 2 Very high 
34 1.4366 4 2 Moderate 
35 1.9479 3 1 Moderate 
36 2.7541 2 2 Very high 
37 0.7544 5 1 Very low 
38 1.9969 3 1 Moderate 
39 1.7917 3 2 High 
40 1.1677 4 1 Low 
41 1.4717 4 2 Moderate 
42 1.4425 4 3 High 
43 3.3575 2 2 Very high 
44 2.1768 3 3 Very high 

Finally, results obtained through cluster analysis were more deeply analyzed. The purpose of 
this step was to rank the measures to understand which are the most important in women’s facial 
beauty and proportion assessment. In particular, the focus was moved on cluster 4 and cluster 5, 


Appl. Sci. 2020, 10, 8 13 of 18 

which contain attractive and very attractive women, respectively. Knowing the cardinality of each 
cluster, that is 17 if clusters 4 and 5 are taken together, values out of range have been computed for 
each measure; a measure must be considered influential if the smallest possible number of women 
has that measure out of range. For instance, referring to Table 8, it is possible to notice that the 
Euclidean distance ls-sto is the most influential measure, because only two women (12%) belonging 
to cluster 4 and cluster 5 are out of range; conversely, ch_r-ch_l is one of the least influential measures 
because only 5 out of 17 women are within range. 

Table 8. The measures rankings. The cardinality of each cluster is reported in the header of the table. 
The first column lists the measures, and the second column reports the number of women belonging 
to cluster 4 or 5 that are out of range. For the sake of completeness, in the other columns, the number 
of women belonging to each cluster has been reported. 

Measures 
Cluster 4 + 5 

(#17) 
Cluster 5 

(#7) 
Cluster 4 

(#10) 
Cluster 3 

(#10) 
Cluster 2 

(#11) 
Cluster 1 

(#6) 
ls-sto 2 (12%) 0 (0%) 2 (20%) 4 (40%) 11 (100%) 5 (83%) 

(sn-pg)/(n-
sn) 

2 (12%) 1 (14%) 1 (10%) 6 (60%) 1 (9%) 5 (83%) 

n-sn 3 (18%) 2 (29%) 1 (10%) 6 (60%) 3 (27%) 5 (83%) 
sn-gn 3 (18%) 0 (0%) 3 (30%) 4 (40%) 11 (100%) 5 (83%) 

(t_r-n)/(t_r-
sn) 

3 (18%) 0 (0%) 3 (30%) 4 (40%) 4 (36%) 3 (50%) 

ex_r-ex_l 4 (23%) 1 (14%) 3 (30%) 5 (50%) 6 (54%) 5 (83%) 
sl-n-sn 4 (23%) 2 (29%) 2 (20%) 5 (50%) 6 (54%) 2 (33%) 

(t_r-t_l)/(n-
pg) 

4 (23%) 0 (0%) 4 (40%) 4 (40%) 6 (54%) 3 (50%) 

n-gn 5 (29%) 0 (0%) 5 (50%) 6 (60%) 6 (54%) 5 (83%) 
li-sto 5 (29%) 2 (29%) 3 (30%) 3 (30%) 7 (64%) 5 (83%) 

n-prn-pg 5 (29%) 1 (14%) 4 (40%) 5 (50%) 3 (27%) 2 (33%) 
(sn-gn)/(n-

gn) 
5 (29%) 1 (14%) 4 (40%) 7 (70%) 3 (27%) 2 (33%) 

en_l-n-en_r 6 (35%) 3 (43%) 3 (30%) 6 (60%) 6 (54%) 4 (67%) 
(sn-pg)/(n-

pg) 
6 (35%) 2 (29%) 4 (40%) 3 (30%) 4 (36%) 2 (33%) 

n-pg 7 (41%) 3 (43%) 4 (40%) 7 (70%) 6 (54%) 5 (83%) 
pg-n-ls 7 (41%) 0 (0%) 7 (70%) 3 (30%) 6 (54%) 4 (67%) 

(n-sn)/(n-pg) 7 (41%) 3 (43%) 4 (40%) 4 (40%) 7 (64%) 3 (50%) 
ls-li 8 (47%) 2 (29%) 6 (60%) 6 (60%) 5 (45%) 4 (67%) 

en_r-en_l 8 (47%) 4 (57%) 4 (40%) 7 (70%) 4 (36%) 4 (67%) 
n-sn-pg 8 (47%) 5 (71%) 3 (30%) 8 (80%) 8 (73%) 5 (83%) 
ac_r-ac_l 9 (53%) 3 (43%) 6 (60%) 6 (60%) 9 (82%) 4 (67%) 

(sto-gn)/(sn-
gn) 

9 (53%) 3 (43%) 6 (60%) 8 (80%) 6 (54%) 2 (33%) 

sn-pg 10 (59%) 3 (43%) 7 (70%) 3 (30%) 6 (54%) 3 (50%) 
ex_l-n-ex_r 10 (59%) 5 (71%) 5 (50%) 6 (60%) 6 (54%) 5 (83%) 
prn-sn-ls 11 (65%) 6 (86%) 5 (50%) 9 (90%) 10 (91%) 6 (100%) 
ch_r-ch_l 12 (71%) 4 (57%) 8 (80%) 8 (80%) 9 (82%) 6 (100%) 

li-(prn-pg) 13 (76%) 4 (57%) 9 (90%) 7 (70%) 5 (45%) 5 (83%) 
ls-(prn-pg) 15 (88%) 7 (100%) 8 (80%) 8 (80%) 9 (82%) 3 (50%) 

t_r-t_l 16 (94%) 6 (86%) 10 (100%) 7 (70%) 10 (91%) 6 (100%) 

4. Discussion 


Appl. Sci. 2020, 10, 8 14 of 18 

Results provided by literature analysis have led to the first outcome of this research, namely the 
expanded set of measures, each of which is characterized by a mean value and standard deviation. 
In the literature, a measure is considered relevant for women’s facial attractiveness if a statistically 
significant number of attractive women possess similar values of the same measure and, conversely, 
that measure assumes different values in subjects belonging to the normal woman set. The critical 
point is to define a ground truth, i.e., a set of measures distinctive for attractive women. All the studies 
carried out in this field agree upon considering attractive those women’s faces that are commonly 
positively evaluated in terms of facial beauty and proportion, and thus famous actresses or beauty 
contest participants, especially those that move on to the final stage of national competitions. 

The present work has focused on gathering measures validated in past studies, building the 
expanded set of measures defined on facial landmarks and analyzing a public database of normal 
women, the Bosphorus. Unfortunately, some of the landmarks used in the literature were not present 
in the set of data of the Bosphorus database. For this reason, an expert has manually added those 
missing landmarks lying on soft tissue, but some of the hard tissue landmarks have not been 
considered due to the impossibility of identifying them without live palpation. Consequently, a 
limited set of measures that could have been included in the expanded set has been discarded. 

In order to analyze the Bosphorus, a methodology able to integrate information provided by 
different sources was required. Thus, the algorithm based on the penalty mechanism has been 
developed, and the results have been clustered to provide the possibility of classifying faces into 5 
different levels. The cluster numbering is from 5–1, that means it ranges from the most compliance 
with the well-proportioned standard face to the least. The cluster numbering is chosen this way so 
that the opportunity of comparing quantitative outcome and qualitative assessment is guaranteed. 
Likert’s 5-level scale has allowed us to analyze the correspondence between the developed 
methodology and people’s assessments. 

Obtained results displayed in Figure 6 show the levels of correspondence in women’s evaluation 
from very high to very low, namely from a 4-level difference to a 0-level difference between 
quantitative and qualitative evaluation. The 31/44 means slightly more than 70%, obtained the 
summing of high and very high correspondence, justifying the theories mentioned in the introduction 
stating the presence of objective elements that are unconsciously but incontrovertibly considered in 
evaluating women’s attractiveness. 

 
Figure 6. Between quantitative and qualitative evaluation. 

Cluster analysis has led this study to discover part of those elements in terms of proportions 
between significant measures, ranking them from the most to the least influential. Some interesting 
observations have arisen from the ranking analysis. Firstly, vertical measures are typically more 
meaningful than the horizontal: Some examples of influential vertical measures are the thickness of 
the upper lip (ls-sto), the height of the central part of the face (n-sn), and the height of the lower part 
of the face (sn-gn). Going deeper into the detail, the lower part of the face in attractive women resulted 
as higher than the central part, coherently with the fact that if they have identical values, faces appear 
rounded, a characteristic not considered attractive. The upper lip was more meaningful than lower 


Appl. Sci. 2020, 10, 8 15 of 18 

lip, but this does not mean it should be greater; rather, it means that it is more ordinary to have a full 
lower lip, while to have also a full upper lip is more uncommon, thus it is a peculiar feature of 
attractiveness. The angle between the two exterior corners of the eyes and the nasion, i.e., the point 
which separates the upper third of the face and the middle third of the face, (ex_l-n-ex_r), and the 
mouth width (ch_r-ch_l) at the end of the table are some examples of less meaningful horizontal 
measures. This does not retract the importance of mouth width in women’s faces assessment; simply, 
a not-so-relevant difference in mouth width between attractive and normal women has been shown 
by the analysis. Secondly, beauty turned out to be strictly connected to proportions; 5 out of 7 ratios 
present in the expanded set of measures are on the top half of the ranking, confirming that women’s 
beauty and well-proportioned faces are also given by relationships between measures, thus they 
involve a holistic process. Another indication is the relatively poor importance of the facial width 
absolute value, t_r-t_l, compared with the ratio between facial width and facial height, (t_r-t_l)/(n-
pg), ranked in the top 10 most influential measures. 

In total, 70% of high and very high correspondence between quantitative and qualitative 
evaluation is a not neglectable result, but the 30% non-similarity of the results needs to be investigated 
in future research. A larger set of attractive and normal women would allow us to enlarge the 
expanded set of measures, as well as the opportunity of live-acquiring all the needed landmarks, 
which would allow us to have uniform data in term of colors, poses, and expressions that could be 
more properly evaluated by humans. Moreover, texture analysis in terms of eye color and skin 
imperfections could further thin that percentage of non-correspondence and bridge the gap between 
perceived beauty and facial proportions.  

5. Conclusions 

The present study has been carried on considering experimental evidence proving that beauty 
assessment is not only subjective, but also relies on objective elements. 

Several previous studies have identified relevant measures to assess women’s facial beauty and 
proportion, comparing sets of attractive and normal women by measuring Euclidean distances, and 
angular and ratio values; measures have been computed relying on landmarks live-identified on 
subject faces. All these works have considered different sets of measures to find differences in the 
normal and attractive populations. 

The importance of the attractive women dataset size is evidenced by the development of the 
methodology, which is penalty-based. That methodology has been developed to consider data 
provided by different sets of attractive women, in order to consider a larger number of measures. 
Future work should focus exactly on that aspect. In the literature there is a lack of attractive women 
datasets, and establishing a larger database will allow us to deepen the analysis related to measures 
peculiar of attractiveness. 

The current work has defined an expanded set of measures gathering all information presented 
in the literature and a methodology to merge the results and to classify faces. For each woman, the 
methodology provides a score, computed as a sum of penalties, and each penalty is given when a 
woman’s measure is not in the range identified by mean value and standard deviation. That 
procedure has been then used on the set of normal women of the Bosphorus public database, and after 
a cluster analysis, it has been possible to classify women’s faces considering their distance from the 
well-proportioned standard face, i.e., the expanded set of measures. 

After that, a deeper investigation on results provided by the cluster analysis has permitted us to 
rank the measures from the most to the least influential, to understand which the most considered 
measures in women’s’ beauty assessment is. Vertical Euclidean distances were shown to be very 
impactful, as well as ratio measures; indeed, the direct comparison between measures resulted in 
being even more significant than the single measure value, as shown by the facial width. Upper lip 
thickness proved to be the most meaningful measure in the mouth area. 

The present work could make a significant contribution to female beauty and proportion 
assessment and proposes some research hints for future works on female attractiveness, but also for 


Appl. Sci. 2020, 10, 8 16 of 18 

other studies that have been discouraged by the misconception of considering beauty something 
merely, or mainly, subjective. 

Author Contributions: Conceptualization, J.-L.D. and F.M.; methodology, L.U. and F.M.; software, L.U. and 
S.M.; validation, L.U.; formal analysis, L.U., J.-L.D. and F.M.; investigation, L.U.; resources, J.-L.D. and E.V.; data 
curation, L.U. and S.M.; writing—original draft preparation, L.U.; writing—review and editing, J.-L.D., E.V., 
S.M. and F.M.; visualization, L.U.; supervision, J.-L.D. and F.M.; project administration, J.-L.D. and E.V.; funding 
acquisition, J.-L.D. and E.V. All authors have read and agreed to the published version of the manuscript. 

Funding: This research received no external funding. 

Conflicts of Interest: The authors declare no conflict of interest. 

References 

1. Nelson, S.M. Diversity of the Upper Paleolithic Venus Figurines and Archeological Mythology. Archeol. 
Pap. Am. Anthropol. Assoc. 2008, 2, 11–22. 

2. Naini, F.B.; Cobourne, M.T.; McDonald, F.; Donaldson, A.N.A. The influence of craniofacial to standing 
height proportion on perceived attractiveness. Int. J. Oral Maxillofac. Surg. 2008, 37, 877–885. 

3. De Campos, D.; Malysz, T.; Bonatto-Costa, J.A.; Pereira Jotz, G.; Pinto De Oliveira Junior, L.; Oxley Da 
Rocha, A. More than a neuroanatomical representation in the Creation of Adam by Michelangelo 
Buonarroti, a representation of the Golden Ratio. Clin. Anat. 2015, 28, 702–705. 

4. Baker, B.W.; Woods, M.G. The role of the divine proportion in the esthetic improvement of patients 
undergoing combined orthodontic/orthognathic surgical treatment. Int. J. Adult Orthodon. Orthognath. Surg. 
2001, 16, 108–120. 

5. Jefferson, Y. Facial beauty—Establishing a universal standard. Int. J. Orthod. Milwaukee 2004, 15, 9–22. 
6. Holland, E. Marquardt’s phi mask: Pitfalls of relying on fashion models and the golden ratio to describe a 

beautiful face. Aesthet. Plast. Surg. 2008, 32, 200–208. 
7. Edler, R.J. Background considerations to facial aesthetics. J. Orthod. 2001, 28, 159–168. 
8. Zhang, D.; Zhao, Q.; Chen, F. Quantitative analysis of human facial beauty using geometric features. Pattern 

Recognit. 2011, 44, 940–950. 
9. Valenzano, D.R.; Mennucci, A.; Tartarelli, G.; Cellerino, A. Shape analysis of female facial attractiveness. 

Vision Res. 2006, 46, 1282–1291. 
10. Varlik, S.K.; Demirbaş, E.; Orhan, M. Influence of lower facial height changes on frontal facial attractiveness 

and perception of treatment need by lay people. Angle Orthod. 2010, 80, 1159–1164. 
11. Iliffe, A.H. A study of preferences in feminine beauty. Br. J. Psychol. 1960, 51, 267–273. 
12. Udry, J.R. Structural correlates of feminine beauty preferences in Britain and the United States: A 

comparison. Sociol. Soc. 1965, 49, 330–342. 
13. Cunningham, M.R. Measuring the Physical in Physical Attractiveness: Quasi-Experiments on the 

Sociobiology of Female Facial Beauty. J. Pers. Soc. Psychol. 1986, 50, 925–935. 
14. Dimitriadis, G. From Palaeolithic “Venus” up to the anthropomorphic statue-menhir. The ideological 

evolution of the human body in prehistoric art. Int. Congr. Ser. 2006, 1286, 7–12. 
15. Janson, H.; Janson, A. History of Art; Harry N. Abrams, Inc.: New York, NY, USA, 1991. 
16. Bax, C. The Beauty of Women; Frederick Muller Ltd.: London, UK, 1946. 
17. Romm, S. The changing face of beauty. Aesthet. Plast. Surg. 1989, 13, 91–98. 
18. Cheney, A.M. Most Girls Want to be Skinny. Qual. Health Res. 2011, 21, 1347–1359. 
19. Abdullah, N.; Naing, L.; Ismail, N.M.; Ismail, A.R. A cross-sectional study of soft tissue facial morphometry 

in children and adolescents. Malays. J. Med. Sci. 2006, 13, 25–29. 
20. Sforza, C.; Laino, A.; D’Alessio, R.; Dellavia, C.; Grandi, G.; Ferrario, V.F. Three-dimensional facial 

morphometry of attractive children and normal children in the deciduous and early mixed dentition. Angle 
Orthod. 2007, 77, 1025–1033. 

21. Sforza, C.; Laino, A.; D’Alessio, R.; Grandi, G.; Tartaglia, G.M.; Ferrario, V.F. Soft-tissue facial 
characteristics of attractive and normal adolescent boys and girls. Angle Orthod. 2008, 78, 799–807. 

22. Jayaratne, Y.S.N.; Deutsch, C.K.; McGrath, C.P.J.; Zwahlen, R.A. Are Neoclassical Canons Valid for 
Southern Chinese Faces? PLoS ONE 2012, 7, e52593. 

23. Ferrario, V.F.; Sforza, C.; Poggio, C.E.; Tartaglia, G.M. Facial morphometry of television actresses compared 
with normal women. J. Oral Maxillofac. Surg. 1995, 53, 1008–1014. 


Appl. Sci. 2020, 10, 8 17 of 18 

24. Sforza, C.; Laino, A.; D’Alessio, R.; Grandi, G.; Binelli, M.; Ferrario, V.F. Soft-tissue facial characteristics of 
attractive Italian women as compared to normal women. Angle Orthod. 2009, 79, 17–23. 

25. Galantucci, L.M.; Deli, R.; Laino, A.; Di Gioia, E.; D’Alessio, R.; Lavecchia, F.; Percoco, G.; Savastano, C. 
Three-Dimensional Anthropometric Database of Attractive Caucasian Women. J. Craniofac. Surg. 2016, 27, 
1884–1895. 

26. Olivetti, E.C.; Nicotera, S.; Marcolin, F.; Vezzetti, E.; Sotong, J.P.A.; Zavattero, E.; Ramieri, G. 3D Soft-Tissue 
Prediction Methodologies for Orthognathic Surgery—A Literature Review. Appl. Sci. 2019, 9, 4550. 

27. Peck Harvey, P.S. A Concept Of Facial Esthetics.pdf. Angle Orthod. 1970, 40, 284–317. 
28. Karatas, O.H.; Toy, E. Three-dimensional imaging techniques: A literature review. Eur. J. Dent. 2014, 8, 132–

140. 
29. Plooij, J.M.; Swennen, G.R.J.; Rangel, F.A.; Maal, T.J.J.; Schutyser, F.A.C.; Bronkhorst, E.M.; Kuijpers-

Jagtman, A.M.; Bergé, S.J. Evaluation of reproducibility and reliability of 3D soft tissue analysis using 3D 
stereophotogrammetry. Int. J. Oral Maxillofac. Surg. 2009, 38, 267–273. 

30. Deli, R.; Galantucci, L.M.; Laino, A.; D’Alessio, R.; Di Gioia, E.; Savastano, C.; Lavecchia, F.; Percoco, G. 
Three-dimensional methodology for photogrammetric acquisition of the soft tissues of the face: A new 
clinical-instrumental protocol. Prog. Orthod. 2013, 14, 32. 

31. Sforza, C.; de Menezes, M.; Ferrario, V.F. Soft- and hard-tissue facial anthropometry in three dimensions: 
What’s new. J. Anthropol. Sci. 2013, 91, 159–184. 

32. Kau, C.H. Creation of the Virtual Patient for the Study of Facial Morphology. Facial Plast. Surg. Clin. N. Am. 
2011, 19, 615–622. 

33. Vezzetti, E.; Marcolin, F. Geometrical descriptors for human face morphological analysis and recognition. 
Rob. Auton. Syst. 2012, 60, 928–939. 

34. Sforza, C.; Ferrario, V.F. Soft-tissue facial anthropometry in three dimensions: From anatomical landmarks 
to digital morphology in research, clinics and forensic anthropology. J. Anthropol. Sci. 2006, 84, 97–124. 

35. Hammond, P.; Hutton, T.J.; Allanson, J.E.; Campbell, L.E.; Hennekam, R.C.M.; Holden, S.; Patton, M.A.; 
Shaw, A.; Temple, I.K.; Trotter, M.; et al. 3D analysis of facial morphology. Am. J. Med. Genet. 2004, 126, 
339–348. 

36. Nanda, V.; Gutman, B.; Bar, E.; Alghamdi, S.; Tetradis, S.; Lusis, A.J.; Eskin, E.; Moon, W. Quantitative 
analysis of 3-dimensional facial soft tissue photographic images: Technical methods and clinical 
application. Prog. Orthod. 2015, 16, 21. 

37. Proffit, W.R.; Raymond, P.W.J.; Sarver, D.M. Contemporary Treatment of Dentofacial Deformity; Mosby: 
Maryland Heights, MO, USA, 2002. 

38. Fan, J.; Chau, K.P.; Wan, X.; Zhai, L.; Lau, E. Prediction of facial attractiveness from facial proportions. 
Pattern Recognit. 2012, 45, 2326–2334. 

39. Berlin, N.F.; Berssenbrügge, P.; Runte, C.; Wermker, K.; Jung, S.; Kleinheinz, J.; Dirksen, D. Quantification 
of facial asymmetry by 2D analysis - A comparison of recent approaches. J. Cranio-Maxillofac. Surg. 2014, 
42, 265–271. 

40. Borelli, C.; Berneburg, M. “Beauty lies in the eye of the beholder”? Aspects of beauty and attractiveness. J. 
Ger. Soc. Dermatol. 2010, 8, 326–331. 

41. Nonis, F.; Dagnes, N.; Marcolin, F.; Vezzetti, E. 3D Approaches and Challenges in Facial Expression 
Recognition Algorithms—A Literature Review. Appl. Sci. 2019, 9, 3904. 

42. O’Toole, A.J.; Price, T.; Vetter, T.; Bartlett, J.C.; Blanz, V. 3D shape and 2D surface textures of human faces: 
The role of “averages” in attractiveness and age. Image Vis. Comput. 1999, 18, 9–19. 

43. Tiddeman, B.; Burt, M.; Perrett, D. Prototyping and transforming facial textures for perception research. 
IEEE Comput. Graph. Appl. 2001, 21, 42–50. 

44. Fink, B.; Grammer, K.; Matts, P.J. Visible skin color distribution plays a role in the perception of age, 
attractiveness, and health in female faces. Evol. Hum. Behav. 2006, 27, 433–442. 

45. Fink, B.; Matts, P.J.; D’Emiliano, D.; Bunse, L.; Weege, B.; Röder, S. Colour homogeneity and visual 
perception of age, health and attractiveness of male facial skin. J. Eur. Acad. Dermatol. Venereol. 2012, 26, 
1486–1492. 

46. Fink, B.; Grammer, K.; Thornhill, R. Human (Homo sapiens) facial attractiveness in relation to skin texture 
and color.pdf. J. Comp. Psychol. 2001, 115, 92–99. 

47. Jones, B.C.; Little, A.C.; Burt, D.M.; Perrett, D.I. When facial attractiveness is only skin deep. Perception 2004, 
33, 569–576. 


Appl. Sci. 2020, 10, 8 18 of 18 

48. Donofrio, L.; Carruthers, A.; Hardas, B.; Murphy, D.K.; Carruthers, J.; Jones, D.; Sykes, J.M.; Creutz, L.; 
Marx, A.; Dill, S. Development and validation of a photonumeric scale for evaluation of facial skin texture. 
Dermatol. Surg. 2016, 42, S219–S226. 

49. Dantcheva, A.; Dugelay, J.-L. Assessment of female facial beauty based on anthropometric, non-permanent 
and acquisition characteristics. Multimed. Tools Appl. 2015, 74, 11331–11355. 

50. Weinberg, S.M.; Raffensperger, Z.D.; Kesterke, M.J.; Heike, C.L.; Cunningham, M.L.; Hecht, J.T.; Kau, C.H.; 
Murray, J.C.; Wehby, G.L.; Moreno, L.M.; et al. The 3D Facial Norms Database: Part 1. A Web-Based 
Craniofacial Anthropometric and Image Repository for the Clinical and Research Community. Cleft Pala. 
Craniofac. J. 2016, 53, e185–e197. 

51. Savran, A.; Alyüz, N.; Dibeklioǧlu, H.; Çeliktutan, O.; Gökberk, B.; Sankur, B.; Akarun, L. Bosphorus 
database for 3D face analysis. Lect. Notes Comput. Sci. 2008, 5372, 47–56. 

52. Farkas, L.G.; Schendel, S.A. Anthropometry of the Head and Face. Am. J. Orthod. Dentofac. Orthop. 1995, 
170, 112–112. 

53. Swennen, G.R.J.; Schutyser, F.; Hausamen, J.-E. Three-Dimensional Cephalometry A Color Atlas and Manual; 
Springer: Berlin/Heidelberg, Germany, 2006; ISBN 9783540254409. 

 
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access 
article distributed under the terms and conditions of the Creative Commons Attribution 
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).