All Work
Research

Byzantine DNA Ancestry Analyzer

Bioinformatics tool exploring Greek Byzantine heritage through genomic analysis

Personal Project·Feb 2026

62.04

My Byzantine Score

379

Ancient Samples

J2b-M12

My Y-DNA

7 Pages

Dashboard

Problem

Modern Greeks lack accessible tools to explore their Byzantine-era genetic heritage. Existing ancestry services focus on broad continental categories and miss the rich genetic substructure of the Byzantine world (330–1453 CE).

Approach

Built a comprehensive genomic analysis pipeline processing raw DNA data (MyHeritage ~700K SNP markers). Runs PCA with smartPCA/scikit-learn against the Allen Ancient DNA Resource (AADR v54+), supervised ADMIXTURE analysis (K=2–12 with cross-validation), genetic distance calculations (Euclidean + Fst), and haplogroup determination - all visualized in an interactive 6-page web dashboard.

Data Processing

Raw DNA ingestion

  • 23andMe / AncestryDNA parsing
  • SNP quality control
  • Reference panel merging

PCA Analysis

Population structure

  • Dimensionality reduction
  • Byzantine reference projection
  • Cluster identification

Admixture

Ancestry estimation

  • Supervised ADMIXTURE
  • K-component optimization
  • Confidence intervals

Reporting

Visual output

  • Ancestry composition chart
  • PCA scatter plot
  • Haplogroup context

Results

A working tool producing Byzantine ancestry composition reports. All screenshots show my own DNA results - Byzantine Score 62.04, Y-DNA haplogroup J2b-M12. Includes interactive PCA plots (West Eurasian overview, Mediterranean zoom, temporal animation), admixture bar charts, geographic heatmaps with Byzantine Empire boundary overlay, and haplogroup context.

Screenshots

My DNA results  - Byzantine Score 62.04 with Y-DNA J2b-M12 haplogroup and closest ancient match

My DNA results - Byzantine Score 62.04 with Y-DNA J2b-M12 haplogroup and closest ancient match

My DNA on PCA  - PC1 vs PC2, PC1 vs PC3, and Mediterranean zoom scatter plots showing my position among ancient populations

My DNA on PCA - PC1 vs PC2, PC1 vs PC3, and Mediterranean zoom scatter plots showing my position among ancient populations

My ancestry composition  - NMF-based admixture analysis at K=3 with cross-validation on my own DNA data

My ancestry composition - NMF-based admixture analysis at K=3 with cross-validation on my own DNA data

My genetic distances  - Top 50 closest ancient individuals to my DNA ranked by distance

My genetic distances - Top 50 closest ancient individuals to my DNA ranked by distance

My haplogroups  - Y-DNA J2b-M12 lineage analysis with Byzantine historical context

My haplogroups - Y-DNA J2b-M12 lineage analysis with Byzantine historical context

Methodology  - Technical details of the analysis pipeline with references

Methodology - Technical details of the analysis pipeline with references

Run Analysis  - Upload MyHeritage CSV to run the Byzantine ancestry pipeline

Run Analysis - Upload MyHeritage CSV to run the Byzantine ancestry pipeline

Technologies

PythonPLINKADMIXTUREsmartPCApandasNumPySciPyscikit-learnPlotlyFoliumFastAPIReactDocker