Reference builds and coordinates
A genomic coordinate only makes sense relative to a reference build. chr1:100000 in hg19 may not refer to the same biological position in hg38 or T2T.
The reference genome is the coordinate system for the analysis. Alignment, variant calling, annotation, liftover, microarray / SNP chip simulation, and gene filtering all depend on it. Mixing builds can silently produce wrong answers because a position can move, disappear, change chromosome naming, or overlap different annotations.
If you receive a BAM, CRAM, or VCF from a lab, save any metadata that identifies the build. If the build is unknown, treat the file cautiously until you can infer or verify it from headers, contig names, known marker positions, or the provider's documentation.
hg19 / GRCh37
Older but still common in genealogy tools, legacy annotations, and many older consumer datasets.
Some third-party tools and older SNP chip coordinates still expect hg19/GRCh37. That does not mean it is always better; it means you must match the expected coordinate system.
hg38 / GRCh38
Modern mainstream reference for many current short-read WGS pipelines and annotation resources.
Many current callers, annotation datasets, and public resources support hg38 well. If you are starting a new short-read workflow, this is often the practical default unless a downstream service requires another build.
T2T
T2T references cover many regions older references represented poorly, but tool and annotation support varies.
T2T can be valuable for difficult regions, but it may not be accepted by consumer tools, older annotation resources, or workflows that assume hg19/hg38 chromosome structures.
Do not mix builds casually. If your
BAM was aligned to hg38, call variants and annotate with hg38-compatible resources unless you intentionally lift over coordinates or re-align. Liftover is useful, but it is not perfect and can fail around complex or ambiguous regions.