Berber and Arab DNA in Europe from the Moorish occupation

North African Berber and Arab Influences in the Western Mediterranean Revealed by Y-Chromosome DNA Haplotypes

We have analyzed Y-chromosome diversity in the western Mediterranean area, examining p49a,f TaqI haplotype V and subhaplotypes Vb (Berber) and Va (Arab). A total of 2,196 unrelated DNA samples, belonging to 22 populations from North Africa and the southern Mediterranean coast of occidental Europe, have been typed. Subhaplotype Vb, predominant in a Berber population of Morocco (63.5%), was also found at high frequencies in southern Portugal (35.9%) and Andalusia (25.4%). The Arab subhaplotype Va, predominant in Algeria (53.9%) and Tunisia (50.6%), was also found at a relatively high frequency in Sicily (23.1%) and Naples (16.4%); its highest frequency in Iberia was in northern Portugal (22.8%) and Andalusia (15.5%). In Iberia there is a gradient of decreasing frequencies in latitude for both subhaplotypes Va and Vb, related to eight centuries of Muslim domination (8th to 15th centuries) in southern Iberia.

During the 7th century A.D., Muslim people coming from the Arabian peninsula and the Middle East invaded North Africa. The most important population movement relating both sides of the Mediterranean Sea was the conquest of the Iberian peninsula by North African populations (with recruited Berbers), soon after the first Muslim invasion. More than eight centuries (8th to 15th centuries) of Muslim domination in the southern part of Iberia imparted an important cultural legacy (Conrad 1998) and probable gene exchanges between North African and Iberian populations.

Variations in DNA sequences specific to the nonrecombinant part of the Y chromosome, relating to paternal ancestry, are particularly interesting from a human population genetics point of view. The first published and most informative probe used in Southern blots for this objective is p49 (locus DYSl), which is able to identify at last five TaqI male-specific fragments (A, C, D, F, and I) that are polymorphic between individuals (Lucotte and Ngo 1985). Sixteen main corresponding haplotypes (numbered I-XVI) were identified using the p49 probe on DNA samples of unrelated males living in France (Ngo et al. 1986). Only recently has the molecular basis of the p49 TaqI polymorphisms been established (Jovelin et al. 2003); the polymorphisms correspond to variable TaqI sites located in the four DAZ genes located in the AZF-c region of the Y chromosome.

In fact, the conventional p49 TaqI polymorphisms were the most popular markers used in various populations because of their ability to detect more than 100 different haplotypes [for a compilation on the subject until the end of 1995, see Poloni et al. (1997)]. Haplotype XV (A3,C1,D2,F1,I1) was the most widespread haplotype in our initial study (Ngo et al. 1986). Haplotype XV was also predominant in the first European study we published (Lucotte and Hazout 1996), with elevated frequencies in French Basques. The geographic distribution of haplotype XV in Europe reveals a gradient of decreasing frequencies from this Basque focus toward eastern peripheral countries (Lucotte and Loirat 1999) but also toward southwestern countries. According to the Y Chromosome Consortium (2002) nomenclature, haplotype XV corresponds to the M173 lineage (Diéterlen and Lucotte 2005).

Haplotype V  (E3b1b) (A2,C0,D0,F1,I1) is the most frequent haplotype in North Africa (Lucotte et al. 2000), with a particularly high frequency (55%) in the populations with a relative predominance of Berber origin. Our previous study on the subject examined the relative frequencies of haplotype V in four Iberian populations compared with a Berber population living in North Africa (Lucotte et al. 2001). The highest frequency of haplotype V (68.9%) was observed in Berbers from Morocco, and the geographic distribution of haplotype V revealed a gradient of decreasing frequencies with latitude in Iberia (40.8% in Andalusia, 36.2% in Portugal, 12.1% in Catalonia, and 11.3% in the Basque Country) (Lucotte et al. 2001); such a cline of decreasing haplotype V frequencies from the south to the north in Iberia clearly established a gene flow from North Africa toward Iberia.

According to the Y Chromosome Consortium (2002) nomenclature, haplogroup E is characterized by the mutations SRY4064, M96, and P29 on a background defined by the insertion of an Alu element (YAP + ). The third clade, E3 (defined by the mutation P2), of haplogroup E is further subdivided into two monophyletic forms, the second one (E3b) being characterized by mutations M35 and M125. All of the 110 p49 TaqI haplotype V subjects from Morocco (51 Berbers and 59 Arabs) that we had previously tested correspond to haplogroup E3b.

In the present study we have subdivided haplotype V into its Berber (Vb) and Arab (Va) components in order to distinguish the relative contributions of these two ethnicity-specific markers in the gene pools of the populations living in Iberia and in other populations in the northern part of the western Mediterranean area.

DNA Samples. This study concerns 2,196 unrelated male DNA samples (Table 1). We collected 904 new unrelated males subjects, from three different countries (Portugal, France, and Italy): 79 from North Portugal and 59 from South Portugal; 243 from the Marseilles region of France; 192 from Genoa, 64 from Rome, and 128 from Naples in continental Italy; 39 from Sicily; and 100 from Sardinia. All these new samples correspond to adult males, whose origin is based on the local birthplace of their fathers and (at least) grandfathers. We have obtained informed consent from each of the French subjects studied.

We add for comparison the following subjects, already tested as bearing haplotype V in previous studies: 11 subjects from Mauritania, 51 Berbers from Morocco, 59 Arabs from Rabat, 80 subjects from Algeria, 39 subjects from Tunisia, and 17 subjects from Libya (Lucotte et al. 2000); 29 Spaniards from Sevilla (Lucotte et al. 2001); 4 Spaniards from Barcelona and 9 French Catalans from Perpignan (Lucotte and Loirat 1999); 11 French Basques, 1 subject from Montpellier, and 7 subjects from Grasse in France and 6 subjects from Milan in Italy (Lucotte and Hazout 1996); and 44 subjects from Corsica (Lucotte et al. 2002).


Table 1 summarizes the frequencies we obtained for haplotype V and sub-haplotypes Vb and Va in the 22 study populations. For the 2,196 males typed, 491 (22.3%) bear haplotype V. The frequency of haplotype V is 35.5% in Portugal, with a more elevated proportion in the south (49.2%) than in the north (25.3%). The frequency of haplotype V in the Marseilles region (11.1%) has a value similar to the mean value in continental France (9%). In Italy the highest frequency is attained in Sicily (28.2%), followed by Naples at 17.2%. As previously shown (Lucotte et al. 2000), haplotype V is found at the highest frequency (68.9%) in Berbers from Marrakech in Morocco; an apparently increasing east-west cline in haplotype V frequencies is shown in North Africa from Libya (44.7%) to Rabat (57.7%), with intermediate values for Tunisia (53.4%) and Algeria (56.7%). In Spain haplotype V is much more frequent (40.9%) in the south of the country [in Andalusia (Sevilla)] than in the north (12.9%) [in Catalonia (Barcelona)].

Subhaplotype Vb is the Berber subhaplotype because its most elevated relative value (63.5%) is obtained for the Berber population of Marrakech. In the non-Berber population of Rabat in Morocco, the frequency of subhaplotype Vb is only 20.6%, whereas the frequency of subhaplotype Va (Arab) is 37.3%. In order of decreasing values, the subhaplotype Vb frequencies are 40% in Mauritania, 35.9% in South Portugal, 25.4% in Andalusia, and 15.8% in Libya. Low frequencies of subhaplotype Vb are found in Sicily (5.1%), Algeria (2.8%), Tunisia (2.7%), and North Portugal (2.5%); frequencies less than 2% are found in French Basques (1.9%), in Naples (0.8%), and in Corsica (0.6%), Subhaplotype Vb is absent in Catalonia (Barcelona and Perpignan), in the south of France (Montpellier, Grasse, and the region of Marseilles), in continental Italy (Milan, Genoa, and Rome), and in Sardinia.

Table 2 summarizes the frequencies of subhaplotype Vb in North Africa, Iberia, the south of France, and Italy. The maximum value (63.5%) concerns the Berber population, but this frequency is notably lower (9.3%) for other populations from North Africa. In southern Iberia an elevated value (30%) is observed, but the frequency of subhaplotype Vb is only 1.8% in northern Iberia. These frequencies are less than 1% in France and Italy.

Figure 2 shows the isofrequency map of subhaplotype Vb in the western Mediterranean area (coordinates on the map: x = longitude, y = latitude). From the Berber focus in Berbers from southern Morocco, the frequencies of subhaplotype Vb decrease in North Africa to the north of Morocco and to the east in Algeria and Tunisia. For Iberia the most elevated value of subhaplotype Vb frequencies is in southern Portugal; relatively elevated values are observed in Andalusia, moderate values are observed in the southern part of Spain, and low values are seen in Catalonia.

In the present study all haplotype V non-subhaplotype Vb subjects are termed subhaplotype Va (Arab) subjects. Their maximum relative frequencies are 53.9% (Algeria), 50% (Tunisia), and 37.3% (Rabat) in North Africa. Table 3 summarizes the frequencies of subhaplotype Va in North Africa, Iberia, southern France, and Italy. The maximum value (45.8%) is found in North Africa. In northern Iberia a slightly more elevated value is observed (20%) compared to southern Iberia (14.6%). A frequency of 10.3% is seen in France, and in Italy the 14.6% value observed in the south is relatively more elevated than in the north (3.4%).

Figure 3 gives the isofrequency map of subhaplotype Vb. In North Africa frequencies decrease from east to west and southward. For southern Europe the map shows the relatively higher percentages observed in the south of Italy versus the north and (to a lesser degree) in the north of Iberia versus the south.

In our PCR assay the 68 Moroccan subjects with subhaplotype Vb (47 Berbers and 21 Arabs) were tested for the M81 marker: All subjects were positive for the M81 marker, so subhaplotype Vb is homologous with subhaplogroup E3b2. The 38 Moroccan non-Berber subjects were further tested for the M78 marker: Only 31 of them (80.8%) were positive for the M78 marker; we conclude that, in Morocco at least, subhaplotype Va corresponds only partly to subhaplogroup E3b1.

P49a,f TaqI haplotype V, which is homologous with haplogroup E3b according to the Y Chromosome Consortium (2002) nomenclature, is the predominant Y-chromosome haplotype in North Africa (Lucotte et al. 2000), where its geographic distribution shows an east to west cline. In the present study we have extended the research of haplotype V frequencies (Lucotte et al. 2001) in various European populations located in the western Mediterranean basin to include France, Portugal, and Italy. The frequency of haplotype V in the Marseilles region is 11.1%, a value similar to the main value we obtained previously for continental France (Lucotte and Hazout 1996). In continental Italy we observed the highest haplotype V frequency in Naples (17.2%); Sicily, with a frequency of 28.2%, corresponds to the most elevated value we observed for Italy. In South Portugal the frequency of haplotype V is very high (49.2%); we had previously obtained a similar value for Libya and for Mauritania. The frequency of haplotype V for North Portugal (25.3%) is similar to the value we obtained for Sicily in the present study.

Y-chromosome DNA haplotypes in North African populations

To better divide haplotype V into its ethnic components, we have subdivided it into subhaplotypes Vb (Berber) and Va (Arab). We have established that subhaplotype Vb is the Berber haplotype, because it is present at very elevated frequencies (63.5%) in our Berber population from Morocco but at relatively low frequencies (20.6%) in our non-Berber population of Rabat. Such a distinction of a Berber component was also realized by Scozzari et al. (2001), because they observed that the haplogroup they named 25.2 was also more frequent in the Berber population from Morocco than in Arabs. Our present results show that subhaplotype Vb frequencies in North Africa decrease from west to east, starting from the Berber focus in Morocco; in the western Mediterranean area subhaplotype Vb is at low frequencies along the south coast of Europe but occurs at relatively elevated frequencies in southern Iberia (peaking at 35.9% in South Portugal). Flores et al. (2004), in their important study of various locations in Iberia, observed that subhaplogroup E3b2 is more frequent in southern Iberia, attaining a maximum value of 11.5% in the region of Málaga.

In the present study all the non-subhaplotype Vb subjects bearing haplo-type V are classified as subhaplotype Va (Arab); they probably correspond to a heterogeneous group representing various ethnicities (our results concerning the incomplete correspondence between subhaplotypes Va and E3b1 in Morocco suggest that). We have shown here that in North Africa the focus of subhaplotype Va frequencies is in Algeria (53.9%) and Tunisia (50.6%); from this focus frequencies of subhaplotype Va decrease in the south and the west of the region.

Subhaplotype Va attains substantial frequencies along the southern coast of Europe; these frequencies reached relatively elevated frequencies in France (Perpignan, 11.8%) and in southern Italy (Naples, 16.4%; Sicily, 23.1%). For Iberia, relatively more elevated values are attained for Andalusia (15.5%) and for North Portugal (22.8%). Brion et al. (2004) also showed relatively higher frequencies of haplogroup E* (xE3a) (up to 18.3%) in their study concerning northern Iberia.

We had previously established (Lucotte et al. 2001) that haplotype V showed a gradient of decreasing frequencies with latitude in Iberia, and we interpreted this pattern as a consequence of the historical Islamic occupation of the peninsula (Conrad 1998). The results reported in the present study concerning subhaplotypes Vb and Va (subhaplotype isofrequencies maps given in Figures 2 and 3) have again shown both of these gradients. From this perspective, the opposite pattern of gradient frequencies observed in Iberia for the western European haplotype XV (Diéterlen and Lucotte 2005) is reconciled with the slow reconquest of the Iberian peninsula from the north by the Christians, which lasted seven centuries and ended in Granada in 1492.

Unfortunately I can’t find the diagrams for this piece. This shows a larger Arab and Berber than the other study I’ve seen into this. I shall have to have another look at it. Notably, for all you Afrocentrists who think Moors were black Africans, there are only Arab and Berber Y chromosomes found. Sub Saharan ancestry only shows up at trace in these studies.

  1. bravo thank youMathilda you just confirmed what i was sure of if even you need some help about the medieval history (arab/moors point of view just contact me)

  2. Thanks for the info. This reports shows more Arab and Berber influence in Iberia than previous reports. I think this is one is closer to reality.

  3. I don’t understand. Haplotype V is homologous to E3b, old nomenclature. Spanish and Portuguese are high R1b. There are not some many E3b haplogroups there compared to R1b. The persentages of haplotype V don’t make sense. Arabs are high haplogroup J1, low haplogroup E3b. So there is an Arab version of E3b that differs from that found in Anatolia and Europe? Sorry, please explain how it is possible for such high E3b in the Iberian population or even the Sicilian one for that matter.
    I am thinking this study is a crock.

  4. I swear I keep running into conflicting studies regarding the analysis of gene flow from North African/Middle Eastern populations to southern Europe. I know how controversial of a topic this is for many people so therefore I don’t really know who to believe heh.

    My guess is that North African ancestry should be slightly higher in Iberia than in Sicily because various parts of Iberia were ruled by the Moors for c. 7 centuries. Sicily was only decidedly occupied by the Moors (a.k.a. Saracens) for roughly 2 centuries. Although the Spanish Inquisition expelled many of the Muslims (and hence presumably “cleansed” the population of substantial North African ancestry in the process), Sicily also underwent a repopulation of its own with the Norman conquest in the 11th century as a large influx of mainland Italians and Frenchmen Latinized the island as many of the Muslims voluntarily left. I’m aware that the Normans were fairly tolerant of the Greek Orthodox Christians and Muslim Arabs in Sicily, but it’s also known that they actively promoted Catholicism and Latinization at the expense of the other cultures and consequently, many Muslims chose to leave. Furthermore, Frederick II expelled some of the Muslims to Lucera in Apulia on the Italian mainland or to Tunisia.

    Maybe some of our Spanish and Portuguese posters could inform me of what mainly transpired in Iberia during the Reconquista but I’m aware that Spain was largely repopulated from the north by the descendants of the Hispano-Roman and Visigothic inhabitants, with some Frenchmen participating in the Reconquista as well.

    As someone of largely Sicilian descent, I’m aware of the fact that of all the groups to have conquered and settled in Sicily, the Greeks had the greatest impact by far in terms of genetic contribution. The first great wave of Greek settlement was in ancient Greek times, and you had a later influx of Byzantine Greeks who fled Slavic invasions in mainland Greece in the Middle Ages when Sicily was a province of the Byzantine Empire (prior to its conquest by the Moors/Saracens). Furthermore, Greeks were never expelled from Sicily in any sizable numbers and were one of the few groups to have brought large numbers of women with them. Moors, Normans, and other conquerors brought only men. I’m frequently told that I have a Greek look myself. Admittedly I’ve also had a few people who thought I looked a bit Middle Eastern so it’s possible that I could have Arab blood, anything’s possible.

    • i have read your note.
      i am concerned with myown sicilian origin, the name miranti.
      al miranti= admiral. in polizzi generosa.

      are there any documents with names in palermo of moorish/berber inhabitants.

      i have my genealogy to 1700. ill try to get friends to dig more.
      i think i may need actual moorish records?

      mark miranda, new york

  5. HALLELUYAH. Hehe, good job!! I’ve been studying this subject (the Moors, who they were, blabla) for áááááááges!!!!! And I just couldn’t find the right answers.. Until now, ’cause this definitely will help a LOT!! Thank you!!!😀

  6. thank you mathilda for your effort, but I think which there is something that misses
    for instance j1 it exists in the north Africa before Arabic and before Phoenician on top of that is Je1 which is specific in Arabic and not j1 it is them who say to this
    and I performed researches and I found other indications more interested deserve one analysis

    as the text of salluste which I have just translated:

    sallust dedicated chapters XVII and XIX of the work The War of Jugurtha to a digression about the country of North Africa and the inhabitants, according to numides traditions and the Punic pounds of the king Hiempsal II.

    After a description of the country — borders, climate, fauna and flora—, the historian introduces Gétules and Libyans as first inhabitants of Africa, « hard, rude, nourished on the flesh of wild animals, eating of the grass as animals. »
    The demigod Hercul died in Spain according to « Africain belief », and her army composed of various people dismantled. Mèdes, Persians, the Armenians of her army passed by boat in Africa and settled on the coast.
    The Persians settled, « more near Ocean » on the West, living in overturned cockles of their boats, for lack of equipment of building. They formed an alliance by marriage with Gétules. Led to move continuously, they gave the name of “Nomadic” (Numides).
    Salluste considers the houses of the peasants to be proof of this tale numides, reminding of those of the cockles overturned by Hercul’s army
    Mèdes and the Armenians united with the Libyans. They « built strong places » and « plays trades with Spain ». Corrupting the name of Mèdes, the indigènes Libyans would have started to call them Maures
    Later, the Persians and Gétules grew in potency and became established of Carthage under the name of Numides on the West. Finally, they appended Libya.
    Almost entirety from the north of Africa was appended by Numides, « the losers blended in with the victors, who gave them their name of Numides ».

    to conclude: if salluste said the truth, this means that j1 exists in the north Africa and south Spain since more than 3 milles years before Jesus

    because j1 they find it at mèdes and persians !!

    and from my studies on Muslim presence in Spain I learnt that Berbers been majority, and the arabe encouraged the artisans and the persiens scientists and the kurds of insataler (come to live) in andalous

    I like to have answers to my questions from your research mathilda

    • In reply to your comment. These ancient Roman historians are generally not very reliable if you compare them to contemporary historians and all things they say should be taken with a pinch of salt. The general rule of thumb with people who study ancient history in university is to compare with other information and trace back to the source of the information.

      You will find that many ancient writers take information from other ancient writers and are also writing about a culture without ever having any kind of contact with. So the information can and is changed with each step away from the original source.

      The culture that is discussed here has no historical record written at the time these events take place therefore looking back using only word of mouth is also unreliable. Furthermore if the source is a myth then it is even more unreliable.

      Personally I do love to read ancient historians, however, they are not entirely reliable and should not be used as a source when searching for historical truth.

  7. This is one of the most overlaped and ridiculous conclusions of some different data.
    The berber influence in Iberia is about 4-7% in all the majour studies (Y).
    In Iberia the most important haplogroup is r1b, in Portugal R1b reaches 60-70% and Spain – basque country to.

