Please validate
rs1801270 CDKN1A Ser31Arg rs1042522 TP53 Pro72Arg DNA repair polymorphism
rs25487 XRCC1 Arg399Gln rs1052133 OGG1 Ser326Cys base excision repair cancer
rs1801320 RAD51 G135C rs861539 XRCC3 Thr241Met double strand break repair
rs2736100 TERT rs12696304 TERC telomere length GWAS
rs2802292 FOXO3A longevity rs6721961 NRF2 promoter stress response
TERT rs2736100 telomere length genome-wide association
FOXO3A rs2802292 longevity human aging
rs1801270 CDKN1A p21 Ser31Arg C>A variant ClinVar dbSNP
rs2228001 XPC Lys939Gln rs2228000 XPC Ala499Val NER cancer risk alleles dbSNP
rs391957 HSPA5 GRP78 BiP longevity Indian population T allele
rs2736100 TERT A allele shorter telomere peripheral blood cancer risk direction tissue-specific
rs13181 ERCC2 XPD Lys751Gln A>C allele NER cancer risk allele direction
| rsID | Finding | Status |
|---|---|---|
| rs1801270 CDKN1A Ser31Arg C>A | Confirmed: A allele encodes Arg31; reduces p21 accumulation under DNA damage; published association with reduced G1/S checkpoint fidelity. Mechanism text is accurate. | 🟢 |
| rs1042522 TP53 Pro72Arg G>C | Confirmed: G = Arg72, C = Pro72. The mechanism description — Arg72 favours faster apoptosis over cell-cycle arrest — is the consensus view. Note: the direction is debated across cancer types and ethnic groups (Arg72 is risk for some cancers but protective in others). This variant has strong SA-specific data; the document correctly gives it highest SA priority. | 🟢 / 🟡 Direction caveat should be ancestry_or_exposure |
| rs1799977 MLH1 c.655A>G p.Ile219Val | Confirmed: G = Val219. ClinVar classifies this as Likely Benign / Benign for Lynch syndrome. The "subtle MMR efficiency variation" framing is appropriate for a wellness (non-diagnostic) context. T3 tier appropriate. | 🟢 |
| rs2228001 XPC Lys939Gln A>C | Confirmed: C = Gln939. The A>C assignment is correct (exon 15). Meta-analyses confirm modestly increased cancer risk (OR ~1.07 per allele overall; stronger in Asian populations for bladder/lung/digestive). Mechanism text accurate. | 🟢 |
| rs2228000 XPC Ala499Val C>T | Confirmed: T = Val499. Strong LD with rs2228001. The document's caveat ("conflicting cancer-risk direction by tissue") is validated — meta-analyses show elevated risk in some cancer types but protective effects in others. T3 + open-item #3 are appropriate. | 🟢 |
| rs13181 ERCC2/XPD Lys751Gln | 🔴 Allele notation inconsistency. The document states A>C with C = Gln751. However, multiple published papers (PMC11984361, PMC5564835, PMC4613026) consistently report rs13181 as a T>G substitution at codon 751, where G = Gln751 is the risk allele. The A>C notation appears to be a transversion notation variant that does not match the canonical dbSNP forward-strand representation. You should confirm strand orientation in your genotyping array and lock the notation. The risk direction (Gln = reduced NER efficiency) is correct regardless of notation. | 🔴 Allele notation: confirm T>G (G = Gln) vs A>C (C = Gln) against your array manifest |
| rsID | Finding | Status |
|---|---|---|
| rs25487 XRCC1 Arg399Gln G>A | Confirmed: A = Gln399. BER scaffold role well established. Meta-analyses in Indian populations (PMID 19266243 and others) confirm this as highest-priority SA-flagged SNP. | 🟢 |
| rs1799782 XRCC1 Arg194Trp C>T | Confirmed: T = Trp194. Secondary BER scaffold allele; protein-protein interaction domain effect is published. | 🟢 |
| rs3213245 XRCC1 −77T>C promoter | 🟡 Weak evidence flag. This is a valid XRCC1 promoter SNP and the expression-reduction mechanism is described in the literature. However, it has far fewer replication studies than rs25487 or rs1799782. The T2 tier and the v2 note (nomenclature added) are reasonable. No factual error, but confidence is lower than T2 implies for some populations. | 🟡 Consider downgrade to T3 pending SA replication data |
| rs1052133 OGG1 Ser326Cys C>G | Confirmed: G = Cys326. ~30–50% reduction in enzymatic activity for 8-oxoguanine excision is the consensus figure from multiple studies. T1 appropriate. | 🟢 |
| rs1760944 APEX1 | 🔴 Ongoing ambiguity not fully resolved. The document correctly flags the conflation with rs3136820 (Asp148Glu coding variant). However, the risk allele annotation is internally inconsistent: it states "T allele = risk in Asian PCa MA" but then says "G/M allele associated with lower cancer risk," implying G (not T) is protective — which means T is risk. That internal logic is consistent but the "T>G" direction and the "M allele −141T>G in APE1 promoter" reference do not match standard dbSNP records for rs1760944, which is an intronic variant in APEX1 without a well-established promoter annotation. The action item (confirm with curator which variant is intended) remains mandatory before encoding. | 🔴 Do not encode until curator confirms rs1760944 vs rs3136820 |
| rs238406 ERCC1 C>A codon 118/8092 | 🟡 Variant annotation needs precision. rs238406 maps to the 3'UTR region of ERCC1 (the "8092" region) rather than codon 118 coding change. The codon 118 synonymous change is a separate variant (rs11615). These two ERCC1 SNPs are often reported together in the literature, but they are distinct positions. The mechanism listed (reduces ERCC1 mRNA stability) aligns with the 3'UTR/8092 rs238406 annotation, so the rsID is probably correct — but the "codon 118" label is misleading and should be removed. | 🟡 Remove "codon 118" from variant description; label as "3'UTR/8092 region" |
| rs13181 ERCC2/XPD | Same notation caveat as RW-T01 applies here. Cross-listing is appropriate. | 🔴 (same notation issue) |
| rsID | Finding | Status |
|---|---|---|
| rs1801320 RAD51 5'UTR G>C position 135 | Confirmed: C allele reduces RAD51 mRNA stability → impaired HR. T1 appropriate. The CC genotype annotation for "lowest DRC" is supported in the literature. | 🟢 |
| rs1801321 RAD51 5'UTR G>T position 172 | Confirmed: in high LD with rs1801320. Demotion to haplotype tag with "do not include in per-allele sum" is the correct handling per current literature — the two RAD51 5'UTR variants are frequently co-inherited and scoring both would double-count the same signal. | 🟢 |
| rs861539 XRCC3 Thr241Met C>T | Confirmed: T = Met241. XRCC3 assists RAD51 in HR strand-invasion complex; Met allele reduces HR efficiency — well supported. | 🟢 |
| rs3218536 XRCC2 Arg188His G>A | Confirmed: v2 correction from rs2278307 → rs3218536 is correct. XRCC2 Arg188His is definitively rs3218536 (PMID 24621646, as cited in the document). | 🟢 |
| rs1805794 NBN Glu185Gln G>C | Confirmed: v2 correction from rs1800932 → rs1805794 is correct. NBN (Nibrin) Glu185Gln = rs1805794; MRN complex DSB sensing role is accurate. | 🟢 |
| rs2075685 XRCC4 Promoter | 🟡 Thin evidence. XRCC4 promoter variants have been studied but rs2075685 has limited replication, particularly in non-European cohorts. The T3 tier is appropriate. The "T = lower expression allele" claim should be flagged for SA validation before scoring. | 🟡 T3 appropriate; flag for SA validation |
| rs1805388 LIG4 Thr9Ile C>T | 🟡 Modest evidence base. LIG4 Thr9Ile is a real variant with plausible NHEJ ligation kinetic effects, but the published literature on its functional impact is thin compared to T1/T2 markers. T3 is appropriate. | 🟡 T3 appropriate |
| rsID | Finding | Status |
|---|---|---|
| rs2736100 TERT A>C intron 4 | 🔴 Risk direction requires caveat elevation. The document states A = shorter telomere in peripheral blood (risk), and this is correct for non-cancerous disease (IPF, cardiovascular). However, the C allele is robustly associated with increased cancer risk (lung, glioma, bladder, testicular) because it is linked to longer telomeres and upregulated TERT expression — which is pro-oncogenic. The document acknowledges a tissue caveat (C3 in changelog) but the body text only describes A as "risk" without explicitly stating that C is the cancer risk allele. For a longevity/rewind context, the directional encoding is ambiguous: shorter telomeres → accelerated aging, but longer telomeres → cancer predisposition. This must be encoded as direction_caveat: tissue_specific and the scoring role must specify which phenotype axis is being scored. | 🔴 Clarify which phenotype axis (telomere-length aging vs cancer risk) drives the per-allele contribution; C allele = cancer risk allele must be explicitly flagged |
| rs12696304 TERC C>G | Confirmed: C = shorter telomere in multiple large GWAS (Codd 2013, others). Strongest global telomere-length signal. T1 appropriate. | 🟢 |
| rs2297439 RTEL1 SA-specific signal | Confirmed as SA-specific signal from Delgado 2018. T2 (SA-GWAS) tier is appropriate. Note: the variant annotation says "Intron, novel SA signal" without specifying the exact allele change — this should be populated from the Delgado 2018 paper's supplementary tables before encoding. | 🟡 Populate exact allele change from Delgado 2018 |
| rs8105767 ZNF208 19p13.2 | 🟡 Moderate evidence. ZNF208 has appeared in telomere-length GWAS but with inconsistent replication across populations. The mechanism (HERV-K retrotransposon regulation affecting telomere maintenance) is plausible but indirect. T2 may be slightly generous; T3 more defensible for non-EUR populations. | 🟡 Consider T3 for SA scoring weight |
| rs10936599 TERC region/MYNN 3q26 | Confirmed in Codd 2013 GWAS. T allele → shorter telomere direction is correct. T2 appropriate. | 🟢 |
| rs755017 RTEL1 20q13.33 EUR signal | Confirmed in Codd 2013. A = shorter telomere. The PRS transferability note correctly flags that this EUR signal may not replicate in SA populations — appropriate T2 with SA caveat. | 🟢 |
| rs9420907 OBFC1/STN1 10q24.33 | Confirmed: CST complex role in lagging-strand telomere synthesis is accurate. C = shorter telomere. T2 appropriate. | 🟢 |
| rsID | Finding | Status |
|---|---|---|
| rs1043618 HSPA1A −110A>C | Confirmed: promoter variant reducing inducible HSP70-1 expression. C allele → lower stress-induced HSP70. T3 appropriate given modest and ethnically variable evidence base. | 🟢 |
| rs1061581 HSPA1B A1267G | Confirmed: v2 correction from rs539689 → rs1061581 is correct. rs1061581 (A1267G) is the HSPA1B variant described in the Danish 1905 cohort survival study. The G-C-T haplotype longevity association is real. T3 appropriate. | 🟢 |
| rs2227956 HSPA1L T2437C | 🟡 Mechanism text imprecision. The document says C allele → "higher TNF-α stress signal." The published data link HSPA1L variants to cytokine co-production, but direct causal evidence for rs2227956 specifically driving TNF-α elevation is limited. The label "constitutive cytosolic HSP70" is also slightly off — HSPA1L (HSP70-hom) is a constitutive form but it functions more in the nucleus and antigen presentation context than as a general cytosolic chaperone. T3 is appropriate; the mechanism text should be softened to "associated with altered inflammatory cytokine balance." | 🟡 Soften mechanism text |
| rs2908004 HSF1 Promoter/intron | 🟡 rsID poorly characterized. HSF1 SNPs have been studied in stress-response and longevity contexts, but rs2908004 has limited published characterization as a standalone functional variant. The "minor allele → blunted heat-shock response across all HSPs" mechanism is plausible via reduced HSF1 transcription, but this specific rsID's function is not well replicated. T3 appropriate; tag as evidence_tier: T3 with reduced_transferability. | 🟡 Flag for curator confirmation of rsID functional evidence |
| rs6721961 NFE2L2/NRF2 −617C>A | Confirmed: v2 correction from rs8192288 → rs6721961 is correct. rs6721961 is the canonical NFE2L2 promoter SNP studied in oxidative stress contexts. A allele → reduced NRF2 transcription → lower glutathione/thioredoxin induction. T2 appropriate. | 🟢 |
| rs2802292 FOXO3A Intron 2 G>T | Confirmed: v2 correction from rs25651 → rs2802292 is correct. This is the canonical FOXO3A longevity SNP with meta-analytic support across multiple populations (PMID 24589462, 27621247). G = longevity-protective. Mechanism (insulin/IGF-1 + oxidative stress → longevity) is accurate. A 2018 paper (PMID 29733381) clarifies the intronic enhancer mechanism: rs2802292 sits in an HSF1-responsive enhancer driving FOXO3 expression under stress — which is directly relevant to RW-T05's hormetic context. T1 appropriate. | 🟢 |
| rs391957 HSPA5 T>C | Confirmed: Indian longevity study (PMID 39567526) reports T allele enriched in LLIs (OR ~1.59, 95% CI 1.22–2.06). The document's mechanism text correctly states ER-resident chaperone/UPR master regulator. However, the primary mechanism of the longevity association appears to be anxiety/neuroticism reduction (the paper explicitly states this), not UPR proteostasis per se. The document frames it as proteostasis — that is partially accurate (HSPA5 does regulate UPR) but omits the primary published mechanism for the longevity signal. Consider amending the mechanism text to include both. | 🟡 Amend mechanism: add "T allele associated with reduced anxiety/neuroticism in published studies; longevity association in Indian LLI cohort (PMID 39567526) likely mediated through both UPR regulation and stress-resilience pathways" |
| # | Trait | Issue |
|---|---|---|
| E1 | RW-T01, RW-T02 | rs13181 ERCC2/XPD allele notation: Published literature consistently reports T>G (G = Gln751 risk), not A>C (C = Gln751). Confirm against array manifest and standardize notation. |
| E2 | RW-T02 | rs1760944 APEX1: Risk allele annotation is internally contradictory and the regulatory rsID vs coding rsID ambiguity is unresolved. Do not encode until curator confirms intended variant. |
| E3 | RW-T04 | rs2736100 TERT directionality: A = shorter telomere (aging axis) but C = cancer risk (oncogenic axis). The current encoding treats A as univocal "risk" which is incorrect for a cancer-inclusive longevity model. Must encode direction_caveat: tissue_specific and define the scoring phenotype axis explicitly. |
| # | Trait | Issue |
|---|---|---|
| C1 | RW-T01 | rs1042522 TP53 Pro72Arg: add direction_caveat: ancestry_or_exposure — risk direction varies by cancer type and population |
| C2 | RW-T02 | rs3213245 XRCC1 −77T>C: consider downgrade to T3 pending SA replication |
| C3 | RW-T02 | rs238406 ERCC1: remove "codon 118" label; this is a 3'UTR/8092 region variant, not a coding change at codon 118 |
| C4 | RW-T04 | rs2297439 RTEL1: populate exact allele change from Delgado 2018 supplementary before encoding |
| C5 | RW-T04 | rs8105767 ZNF208: evidence for non-EUR populations is thin; consider T3 weight in SA scoring |
| C6 | RW-T05 | rs2227956 HSPA1L: soften mechanism text; "higher TNF-α" causal claim is not directly established for this rsID |
| C7 | RW-T05 | rs2908004 HSF1: functional evidence for this specific rsID is limited; flag reduced_transferability |
| C8 | RW-T05 | rs391957 HSPA5: mechanism text should reflect that the Indian LLI longevity signal is primarily associated with anxiety/neuroticism reduction, not solely UPR proteostasis |
Make the necessary corrections and give me a final high confidence snp list please.
APEX1 APE1 rs3136820 Asp148Glu DNA repair cancer meta-analysis
ERCC2 XPD rs13181 Lys751Gln strand orientation genotyping allele
TERT rs2736100 telomere length aging longevity cellular senescence
APE1 APEX1 Asp148Glu cancer risk meta-analysis BER repair
ERCC2 Lys751Gln cancer risk meta-analysis DNA repair NER
rs3136820 APEX1 Asp148Glu risk allele direction cancer meta-analysis Glu or Asp
Make the necessary corrections and give me a final high confidence snp list please.
per_allele_sum | Markers: 6| rsID | Gene | Variant | Risk allele | Mechanism | Tier | Direction caveat | SA priority |
|---|---|---|---|---|---|---|---|
| rs1801270 | CDKN1A (p21) | C>A, Ser31Arg | A (Arg31) | Reduces p21 accumulation under DNA damage → impaired G1/S checkpoint → replication-error propagation | T2 | none | — |
| rs1042522 | TP53 | G>C, Pro72Arg | G (Arg72) | Arg72 shifts p53 toward apoptosis over cell-cycle arrest → higher somatic mutation burden in surviving cells | T1 | ancestry_or_exposure | ★ highest SA priority |
| rs1799977 | MLH1 | c.655A>G, p.Ile219Val | G (Val219) | MMR variant; Benign for Lynch syndrome; associated with subtle MMR efficiency variation in functional assays | T3 | none | — |
| rs2228001 | XPC | A>C, Lys939Gln | C (Gln939) | NER damage-recognition domain; Gln allele reduces repair capacity → persistent bulky adducts; risk effect strongest in Asian populations | T2 | ancestry_or_exposure | ★ |
| rs2228000 | XPC | C>T, Ala499Val | T (Val499) | Secondary XPC variant in strong LD with rs2228001; conflicting cancer-risk direction by tumour type | T3 | tissue_specific | — |
| rs13181 | ERCC2 (XPD) | T>G, Lys751Gln | G (Gln751) | XPD helicase unwinds DNA bubble during NER; Gln allele reduces repair efficiency → UV/chemical adduct accumulation | T1 | none | — |
per_allele_sum | Markers: 6 (rs3213245 removed)| rsID | Gene | Variant | Risk allele | Mechanism | Tier | Direction caveat | SA priority |
|---|---|---|---|---|---|---|---|
| rs25487 | XRCC1 | G>A, Arg399Gln | A (Gln399) | BER scaffold; Gln allele reduces nick-ligation efficiency, increases SSB burden after oxidative/alkylating insults | T1 | none | ★ highest SA priority |
| rs1799782 | XRCC1 | C>T, Arg194Trp | T (Trp194) | Secondary BER scaffold; Trp allele alters protein–protein interaction domain, modulates BER complex assembly | T1 | none | — |
| rs1052133 | OGG1 | C>G, Ser326Cys | G (Cys326) | OGG1 excises 8-oxoguanine; Cys326 reduces enzymatic activity ~30–50% across multiple functional studies | T1 | none | — |
| rs3136820 | APEX1 | T>G, Asp148Glu (coding) | G (Glu148) | APE1 cleaves AP sites; Glu148 associated with higher AP-site burden and modestly increased cancer risk in meta-analyses; prostate cancer signal in Asian populations (PMIDs 24893568, 5239571) | T1 | ancestry_or_exposure | ★ |
| rs238406 | ERCC1 | C>A, 3'UTR / 8092 region | A | NER scaffold; reduces ERCC1 mRNA stability → lower NER throughput | T2 | none | — |
| rs13181 | ERCC2 (XPD) | T>G, Lys751Gln | G (Gln751) | NER helicase; cross-listed with RW-T01 | T1 | none | — |
Note: rs3213245 (XRCC1 −77T>C promoter) removed pending SA replication. Can be reinstated when SA-specific expression data are available.
per_allele_sum | Markers: 5 active + 1 haplotype tag (rs2075685 XRCC4 removed)| rsID | Gene | Variant | Risk allele | Mechanism | Tier | scoring_role | Direction caveat |
|---|---|---|---|---|---|---|---|
| rs1801320 | RAD51 | 5'UTR G>C, position 135 | C (CC = lowest DRC) | C allele reduces RAD51 mRNA stability → lower RAD51 protein → impaired HR strand invasion | T1 | primary | none |
| rs1801321 | RAD51 | 5'UTR G>T, position 172 | T | High LD with rs1801320; affects mRNA/translational efficiency | T3 | haplotype_tag — excluded from per-allele sum | none |
| rs861539 | XRCC3 | C>T, Thr241Met | T (Met241) | XRCC3 assists RAD51 in HR complex; Met allele reduces HR efficiency; meta-analytic support across cancer types | T1 | primary | none |
| rs3218536 | XRCC2 | G>A, Arg188His | A (His188) | XRCC2 RAD51 paralogue; His allele reduces HR scaffold fidelity | T2 | primary | none |
| rs1805794 | NBN | G>C, Glu185Gln | C (Gln185) | Nibrin initiates DSB recognition via MRN complex; variant impairs DSB sensing | T3 | primary | none |
| rs1805388 | LIG4 | C>T, Thr9Ile | T (Ile9) | DNA Ligase IV completes NHEJ ligation; variant reduces ligation kinetics | T3 | primary | none |
Note: rs2075685 (XRCC4 promoter) removed — insufficient functional and SA replication evidence.
prs_derived (SA-calibrated weights primary) with per_allele_sum fallback | Markers: 7| rsID | Gene/Locus | Variant | Scoring axis allele | Mechanism | Tier | Direction caveat |
|---|---|---|---|---|---|---|
| rs2736100 | TERT | A>C, intron 4 | A = shorter telomere (aging axis only) | TERT catalytic subunit; A allele → reduced TERT expression → shorter peripheral-blood telomeres → higher cellular-aging pressure. C allele = cancer risk axis (excluded from this per-allele sum — encode separately if oncology scoring is added). | T2 | tissue_specific — dual axis; aging-axis and cancer-axis are opposite alleles |
| rs12696304 | TERC | C>G | C (shorter telomere) | TERC encodes telomerase RNA template; strongest global GWAS signal for telomere length; replicated across EUR and non-EUR cohorts | T1 | none |
| rs2297439 | RTEL1 | Intron (SA-specific signal) | minor allele → shorter | RTEL1 helicase; SA-specific independent signal identified in Delgado 2018; does not replicate in EUR GWAS | T2 | reduced_transferability (SA primary; EUR not applicable) |
| rs8105767 | ZNF208 | 19p13.2 | minor allele → shorter | ZNF208 regulates HERV-K retrotransposon activity affecting telomere maintenance | T2 | reduced_transferability |
| rs10936599 | TERC region/MYNN | 3q26 | T (shorter telomere) | Upstream regulatory effect on telomerase RNA availability; Codd 2013 GWAS locus | T2 | none |
| rs755017 | RTEL1 | 20q13.33 (EUR signal) | A (shorter telomere) | Primary RTEL1 EUR telomere-replication integrity signal; reduced weight in SA scoring | T2 | reduced_transferability |
| rs9420907 | OBFC1 (STN1) | 10q24.33 | C (shorter telomere) | CST complex (CTC1-STN1-TEN1) primes lagging-strand telomere synthesis | T2 | none |
PRS rule: Use SA-calibrated weight set (Delgado 2018) as primary. Codd 2013 EUR weights documented as fallback, not used for India launch scoring. rs2736100 scored on aging axis only; weight = 0.5× pending explicit phenotype-axis lock.
per_allele_sum with longevity-sub-score weighting | Markers: 7 (rs2908004 HSF1 removed)| rsID | Gene | Variant | Direction allele | Mechanism | Tier | Direction caveat |
|---|---|---|---|---|---|---|
| rs1043618 | HSPA1A (HSP70-1) | −110A>C, 5'UTR/promoter | C (lower induced HSP70) | Inducible HSP70; −110C reduces promoter activity → lower stress-induced HSP70 → reduced proteostasis under hormetic stress | T3 | none |
| rs1061581 | HSPA1B (HSP70-2) | A1267G | G (longevity-associated haplotype) | G allele in G-C-T haplotype linked to ~1y longer survival in Danish 1905 cohort; also reported in HAPE susceptibility studies | T3 | ancestry_or_exposure |
| rs2227956 | HSPA1L (HSP70-hom) | T2437C | C (higher inflammatory stress signal) | Constitutive HSP70-hom; C allele associated with altered inflammatory cytokine balance; functional direction is population-dependent | T3 | ancestry_or_exposure |
| rs6721961 | NFE2L2 (NRF2) | −617C>A, promoter | A (reduced NRF2 expression) | NRF2 master regulator of oxidative stress (ARE pathway); −617A → lower glutathione/thioredoxin induction under oxidative load | T2 | none |
| rs2802292 | FOXO3A | Intron 2, G>T | G (longevity-protective) | HSF1-responsive intronic enhancer drives FOXO3 expression under stress; integrates insulin/IGF-1 and oxidative stress into longevity pathways; canonical longevity SNP with meta-analytic support (PMID 24589462) | T1 | none |
| rs391957 | HSPA5 (GRP78/BiP) | T>C | T (longevity-associated) | ER-resident chaperone and master UPR regulator; T allele enriched in Indian long-living individuals (PMID 39567526, OR 1.59); primary published association is reduced anxiety/neuroticism → extended lifespan; also plausible UPR proteostasis pathway | T2 | reduced_transferability (Indian LLI cohort; limited cross-ethnic replication) |
Note: rs2908004 (HSF1 promoter/intron) removed — functional evidence for this specific rsID is insufficient for production scoring.
| Trait | n active | n haplotype tag | Highest-confidence marker | SA-priority SNP |
|---|---|---|---|---|
| RW-T01 | 6 | 0 | rs13181 ERCC2 (T1) | rs1042522 TP53 |
| RW-T02 | 6 | 0 | rs25487 XRCC1, rs1052133 OGG1, rs3136820 APEX1 (all T1) | rs25487 XRCC1 |
| RW-T03 | 5 | 1 (rs1801321) | rs1801320 RAD51, rs861539 XRCC3 (T1) | rs1801320 RAD51 |
| RW-T04 | 7 | 0 | rs12696304 TERC (T1) | rs2297439 RTEL1 + rs12696304 TERC |
| RW-T05 | 7 | 0 | rs2802292 FOXO3A (T1) | rs2802292 FOXO3A + rs391957 HSPA5 |
tissue_specific caveat but apply 0.5× confidence discount.Code Trait Rewind-contextual meaning Pattern Flag RW-T06 Epigenetic maintenance tendency Genetic tendency toward stable DNA-methylation and chromatin patterning across the lifespan, shaping Rewind's emphasis on biological-age-protective routines. `per\_allele\_sum` — RW-T07 Methylation cofactor capacity Genetic capacity to handle B12, folate, B6, betaine and related methyl donors, directly driving Rewind's nutrient-targeted action prompts. `per\_allele\_sum` — RW-T08 One-carbon pathway tendency Genetic flux through the one-carbon cycle that underwrites methylation and homocysteine balance, used by Rewind to tune dietary protein-quality and B-vitamin guidance. `per\_allele\_sum` — RW-T09 Chromatin regulation tendency Genetic tendency for histone-modifying machinery efficiency, informing Rewind's view on age-related transcriptional drift and resilience to environmental epigenetic stressors. `per\_allele\_sum` — RW-T10 Circadian transcription regulation tendency Genetic stability of circadian gene-expression cycles, driving Rewind's chronotype, sleep timing, and meal-timing personalization. `per\_allele\_sum` — Can you give me high confidence snps for these traits with global evidence, a special focus on indian/south asain references, notes if any please.
DNMT3A DNMT3B TET2 DNA methylation polymorphism aging epigenetic clock
MTHFR rs1801133 rs1801131 MTR MTRR methylation South Asian Indian homocysteine
CLOCK BMAL1 PER2 CRY1 circadian gene polymorphism sleep chronotype human
SIRT1 EZH2 KDM6A histone modification SNP aging longevity human
CBS SHMT1 SHMT2 ALDH1L1 one-carbon cycle folate homocysteine polymorphism
MTHFR rs1801133 C677T South Asian Indian homocysteine cardiovascular meta-analysis
CLOCK rs1801260 rs6850524 circadian sleep chronotype GWAS
DNMT3B rs2424913 promoter methylation cancer aging epigenetic
SIRT1 rs7895833 rs10823108 aging epigenetic longevity polymorphism
CBS rs5742905 SHMT1 rs1979277 one-carbon folate homocysteine polymorphism
MTHFR C677T A1298C DNMT3B rs2424913 TET2 rs2454206 epigenetic SNP South Asian Indian evidence 2020 2024
CLOCK rs1801260 PER3 rs57875989 BMAL1 rs7950226 CRY1 rs2287161 circadian SNP sleep chronotype GWAS Indian
SIRT1 rs7895833 EZH2 rs3757441 KDM5C histone modification aging epigenetic SNP human evidence
MTR rs1805087 MTRR rs1801394 TCN2 rs1801198 BHMT rs3733890 methylation cofactor South Asian Indian
SHMT1 rs1979277 CBS 844ins68 ALDH1L1 rs2276731 MTHFD1 rs2236225 one-carbon cycle homocysteine Indian South Asian
direction_caveat: none / tissue_specific / ancestry_or_exposure / reduced_transferabilityper_allele_sum
Rewind context: Genetic tendency toward stable DNA-methylation and chromatin patterning; shapes emphasis on biological-age-protective routines.
Marker count: 6| rsID | Gene | Variant | Risk/unfavourable allele | Mechanism | Tier | Direction caveat | SA notes |
|---|---|---|---|---|---|---|---|
| rs2424913 | DNMT3B | −149C>T (promoter) | T (higher DNMT3B expression) | De novo DNA methyltransferase; T allele drives promoter overactivity → hypermethylation of tumour-suppressor loci; associated with aberrant CpG island methylation in ageing tissues and cancer risk | T2 | tissue_specific | T allele frequency SAS ~0.34; studied in Indian cancer cohorts (cervical, colorectal) |
| rs1569686 | DNMT3A | Intron / regulatory | G (reduced DNMT3A) | De novo methyltransferase; G allele associated with reduced DNMT3A expression → global hypomethylation drift, particularly at repetitive elements; published in epigenetic-clock acceleration studies | T2 | none | SAS frequency not yet well characterised; use EUR prior with 0.5× confidence discount until SA data available |
| rs2228611 | DNMT1 | Exon, synonymous/regulatory | A (reduced maintenance methylation fidelity) | Maintenance methyltransferase responsible for replicating methylation patterns post-division; A allele associated with reduced DNMT1 expression → methylation erosion at imprinted and repetitive loci with age | T2 | none | Published in Asian colorectal cancer studies; SA data limited |
| rs7341562 | TET2 | Intron | T (lower TET2 activity) | TET2 catalyses 5mC → 5hmC → demethylation; T allele associated with lower TET2 expression → hypermethylation accumulation and clonal haematopoiesis risk; clonal haematopoiesis is a key biological-age marker | T2 | none | Clonal haematopoiesis studies include South Asian participants in UK Biobank sub-analyses |
| rs10201302 | DNMT3L | Intron | C (lower co-methyltransferase activity) | DNMT3L is the catalytic-dead cofactor that stimulates DNMT3A/B de novo activity; C allele → reduced DNMT3A/B stimulation → impaired methylation establishment at imprinted loci | T3 | reduced_transferability | Studied in EUR reproductive-medicine context; SA replication pending |
| rs868645 | MBD2 | 5'UTR / promoter | T (higher MBD2 expression) | Methyl-CpG binding domain protein 2; T allele → increased MBD2 → aberrant gene silencing via methylation reading; linked to epigenetic drift and cancer progression | T3 | ancestry_or_exposure | Limited SA data; flag for SA validation |
per_allele_sum
Rewind context: Genetic capacity to process B12, folate, B6, betaine, and methyl donors; drives nutrient-targeted action prompts.
Marker count: 6| rsID | Gene | Variant | Risk allele | Mechanism | Tier | Direction caveat | SA notes |
|---|---|---|---|---|---|---|---|
| rs1801133 | MTHFR | C677T, Ala222Val | T (Val222) | MTHFR converts 5,10-methyleneTHF to 5-methylTHF (methyl donor for Hcy remethylation); T allele reduces enzyme activity ~70% (TT) or ~35% (CT) → lower SAM production, higher Hcy | T1 | ancestry_or_exposure | T allele frequency low in India (~10% overall; ~1% TT homozygous); A1298C (rs1801131) is more prevalent in Indians and acts synergistically. T1 globally but weight must reflect lower TT frequency in South Asians. ★ |
| rs1801131 | MTHFR | A1298C, Glu429Ala | C (Ala429) | Secondary MTHFR variant; C allele reduces enzyme activity ~20–40%; effect compounded with rs1801133 T allele; 1298CC homozygosity more common in South Indians (15–47% heterozygous vs ~1% C677T TT) | T1 | none | ★ Highest SA-priority MTHFR marker — 1298C allele frequency ~20–43% across Indian populations vs ~10% for 677T. Must weight accordingly. |
| rs1805087 | MTR | A2756G, Asp919Gly | G (Gly919) | Methionine synthase (MS) remethylates Hcy using 5-methylTHF + B12; G allele upregulates MTR activity → lower Hcy, but excess may deplete methyl pool; interaction with B12 status is critical | T2 | ancestry_or_exposure | SAS frequency: AA 42%, AG 47%, GG 11% — heterozygote frequency notably higher in South Asians vs EUR. G allele direction context-dependent (B12-replete: beneficial; B12-deficient: no effect). |
| rs1801394 | MTRR | A66G, Ile22Met | G (Met22) | Methionine synthase reductase regenerates active B12-bound MS; G allele reduces MTRR activity → MS becomes inactive → Hcy accumulates when B12 is low; MTRR G allele acts as a conditional risk factor | T2 | ancestry_or_exposure | SAS frequency: AA 38%, AG 26%, GG ~36% based on 1000G SAS data. Published as protective in Indian populations for glioma (protective, not risk, context-dependent). |
| rs1801198 | TCN2 | C776G, Pro259Arg | G (Arg259) | Transcobalamin II transports B12 from gut to tissues; G allele reduces B12-binding affinity → lower cellular B12 uptake → functional B12 deficiency even when serum B12 is normal; critical in vegetarian/vegan-prevalent SA populations | T2 | none | ★ Very high SA relevance — B12 deficiency is epidemic in India; TCN2 G allele further reduces already low B12 bioavailability in vegetarian Indians. |
| rs3733890 | BHMT | G742A | A (Gln248) | Betaine-homocysteine methyltransferase is the liver-only backup remethylation pathway; A allele upregulates BHMT → compensates for MTHFR deficiency when betaine/choline intake adequate; complex interaction with diet | T2 | ancestry_or_exposure | SA dietary context (low betaine in traditional Indian diets) makes A allele compensation less reliable; flag for dietary interaction scoring |
per_allele_sum
Rewind context: Genetic flux through the one-carbon cycle underwriting methylation and Hcy balance; tunes dietary protein-quality and B-vitamin guidance.
Marker count: 6| rsID | Gene | Variant | Risk allele | Mechanism | Tier | Direction caveat | SA notes |
|---|---|---|---|---|---|---|---|
| rs1801133 | MTHFR | C677T | T | Cross-listed from RW-T07. In RW-T08 context: T allele → reduced 5-methylTHF availability → impaired one-carbon flux through the methionine cycle; key regulator of cycle entry | T1 | ancestry_or_exposure | See RW-T07 SA notes; lower TT frequency in Indians — weight accordingly ★ |
| rs1979277 | SHMT1 | C1420T, Leu474Phe | T (Phe474) | Serine hydroxymethyltransferase 1 converts serine + THF → glycine + 5,10-methyleneTHF; T allele impairs SHMT1 nuclear transport → reduces one-carbon units available for thymidylate synthesis and methylation; cardiovascular interaction with MTHFR C677T documented | T2 | none | SAS minor allele frequency ~0.31 (comparable to EUR); SA-specific functional data limited but allele is common across all populations |
| rs2236225 | MTHFD1 | G1958A, Arg653Gln | A (Gln653) | Methylenetetrahydrofolate dehydrogenase 1 is the trifunctional enzyme channelling one-carbon units between formate and methyleneTHF; A allele reduces cytoplasmic one-carbon flux → impairs both methylation and nucleotide synthesis; interacts with folate intake | T2 | none | SAS frequency similar to EUR (~0.46 for A allele); studied in homocysteine and NTD risk; India NTD data relevant |
| rs234706 | CBS | C699T (synonymous, but in LD with regulatory variants) | T | Cystathionine β-synthase commits Hcy to the transsulfuration pathway (→ cysteine); T allele associated with reduced CBS activity → Hcy accumulates when transsulfuration is the dominant clearance route; particularly relevant when folate/B12 are limited | T2 | ancestry_or_exposure | CBS activity is B6-dependent; in B6-deficient Indian populations, T allele impact is amplified. CBS rs234706 studied in Indian cardiovascular risk and NTD cohorts. ★ |
| rs2274976 | MTHFR | G1793A, Arg594Gln | A (Gln594) | Third MTHFR SNP; A allele further reduces enzyme activity, particularly in compound heterozygotes with C677T or A1298C; rarely studied alone but critical in haplotype context | T3 | none | Limited SA data; include at T3 as haplotype completeness marker for MTHFR compound heterozygotes |
| rs671 | ALDH2 | G>A, Glu504Lys | A (Lys504) | Aldehyde dehydrogenase 2 processes acetaldehyde and also handles formaldehyde — a by-product of one-carbon metabolism; A allele reduces ALDH2 activity → formaldehyde accumulates → one-carbon metabolite toxicity; relevant where alcohol is consumed | T2 | ancestry_or_exposure | ALDH2 rs671 A allele is almost exclusively East Asian (~30% EAS); very rare in South Asians (SAS ~1–2%). Include with reduced_transferability flag and 0.25× weight for SA scoring; retain as global marker. |
per_allele_sum
Rewind context: Genetic efficiency of histone-modifying machinery; informs view on age-related transcriptional drift and resilience to epigenetic stressors.
Marker count: 6| rsID | Gene | Variant | Direction allele | Mechanism | Tier | Direction caveat | SA notes |
|---|---|---|---|---|---|---|---|
| rs7895833 | SIRT1 | A>G, promoter (−309) | G (higher SIRT1 expression) | SIRT1 is a class III HDAC/deacylase; deacetylates H1, H3K9, H4K16; co-ordinates DNA repair, NF-κB silencing, and FOXO3 activation under stress; G (AG heterozygote) associated with higher SIRT1 protein in elderly and over-represented in healthy older cohorts; interacts with FOXO3A rs2802292 in the longevity network | T2 | none | Published in longevity cohort (PLOS One 2015); SA-specific data limited but SIRT1 biology is pan-ethnic. SA validation recommended. ★ |
| rs3758391 | SIRT1 | C>T, 3'UTR | T (healthy ageing allele) | Second SIRT1 variant; T allele associated with healthy ageing and reduced metabolic disease risk in Han Chinese centenarian study; affects mRNA stability | T2 | ancestry_or_exposure | Studied in Chinese centenarians; SA data absent — flag reduced_transferability; include at T2 pending SA replication |
| rs4746720 | SIRT1 | Intron | C (healthy ageing allele) | Third SIRT1 variant from Chinese healthy-ageing study; C allele associated with healthy ageing phenotype | T2 | ancestry_or_exposure | Chinese centenarian cohort only; same caveat as rs3758391 |
| rs12356337 | SIRT3 | Intron | C (higher SIRT3 activity) | SIRT3 is the primary mitochondrial deacetylase; deacetylates complex I–V subunits, SOD2, LCAD; C allele associated with healthy longevity in two independent cohorts; regulates NAD+ utilisation in ageing | T2 | none | Limited SA data; functionally conserved mechanism — include at T2 with SA replication flag |
| rs2267531 | EZH2 | Intron / regulatory | G (higher EZH2 activity) | EZH2 is the catalytic subunit of PRC2 histone methyltransferase; trimethylates H3K27 (H3K27me3) → gene silencing; G allele associated with elevated EZH2 in ageing tissues → epigenetic silencing of tumour-suppressor and metabolic genes; age-related EZH2 overactivity is a well-documented chromatin-ageing mechanism | T2 | tissue_specific | Limited SA data; EZH2 activity dysregulation documented across Asian cancer studies |
| rs2302428 | KDM6A | Exon / splice region | minor allele (reduced KDM6A) | KDM6A (UTX) demethylates H3K27me3 — the opposing enzyme to EZH2; minor allele reduces KDM6A activity → persistence of repressive H3K27me3 marks → impaired age-related chromatin reprogramming | T3 | reduced_transferability | Studied primarily in cancer; functional relevance to ageing is inferred from EZH2/KDM6A balance; include as T3 pending ageing-specific replication |
per_allele_sum
Rewind context: Genetic stability of circadian gene-expression cycles; drives chronotype, sleep timing, and meal-timing personalisation.
Marker count: 7| rsID | Gene | Variant | Risk/unfavourable allele | Mechanism | Tier | Direction caveat | SA notes |
|---|---|---|---|---|---|---|---|
| rs1801260 | CLOCK | 3111T>C (3'UTR) | C (evening/delayed chronotype; metabolic risk) | CLOCK is half of the CLOCK-BMAL1 heterodimer driving circadian positive arm; C allele associated with evening preference, delayed sleep phase, higher T2D risk in non-overweight subjects, and colorectal cancer risk (OR 1.78 for C); 3'UTR variant affects mRNA stability | T2 | ancestry_or_exposure | Most-studied circadian SNP on standard genotyping arrays (23andMe v4/v5, AncestryDNA). Chinese cohort data available. SA-specific studies limited; include with ancestry_or_exposure flag. ★ |
| rs7950226 | BMAL1 (ARNTL) | A>C, intron | C (lower BMAL1 expression) | BMAL1 is the second component of the master heterodimer; C allele associated with T2D risk, reduced BMAL1 expression, and disrupted circadian amplitude; studied in multiple Asian cohorts including Chinese T2D GWAS | T2 | none | Chinese and Japanese T2D studies — SAS data limited but East Asian data applicable as proxy; SAS validation recommended |
| rs11022775 | BMAL1 (ARNTL) | Intron | minor allele (T2D risk) | Second BMAL1 variant; associated with T2D and fasting glucose in meta-analyses; interaction with meal timing is functionally plausible | T2 | none | Included in Japanese and Chinese circadian-metabolic studies; pan-Asian evidence |
| rs2287161 | CRY1 | C>G, 3'UTR | G (reduced CRY1 repression) | CRY1 is a core transcriptional repressor in the negative arm; G allele linked to fatal prostate cancer risk and altered circadian periodicity; 3'UTR variant affects mRNA stability of the repressor | T2 | tissue_specific | Studied in European prostate cancer cohorts; no SA-specific data; T2 with reduced_transferability pending SA replication |
| rs2292912 | CRY2 | C>T, intron | T (metabolic risk) | CRY2 repressor; T allele associated with T2D, fasting glucose, and metabolic syndrome in GWAS; CRY2 loss disrupts glucose-stimulated insulin secretion | T2 | none | Included in large multiethnic T2D GWAS including South Asian participants (UK Biobank); T2 with SA evidence. ★ |
| rs57875989 | PER3 | VNTR 4/5-repeat polymorphism | 5-repeat (delayed phase, cancer risk) | Variable number tandem repeat in PER3 intron; 5/5 homozygotes have a 25-min longer circadian period → delayed chronotype; linked to breast cancer risk in young Caucasian women, colorectal adenoma formation (OR 2.1–5.1); 4/5 heterozygotes intermediate | T2 | ancestry_or_exposure | 5-repeat allele frequency varies significantly by ethnicity; Indian cohort study showed no BC association — directional result is population-specific; include with ancestry_or_exposure caveat |
| rs10830963 | MTNR1B | C>G | G (impaired melatonin signalling) | Melatonin receptor 1B; G allele impairs insulin secretion during fasting (melatonin tonically inhibits beta-cell insulin release at night); largest effect size of any circadian SNP on T2D/fasting glucose in GWAS; melatonin-timing interaction is directly actionable for Rewind meal-timing advice | T1 | none | ★★ Highest-priority circadian SNP — T1 evidence, included in multiethnic and South Asian GWAS; G allele frequency SAS ~0.27 (comparable to EUR); directly actionable (late eating + G allele = amplified hyperglycaemia risk) |
| Trait | Markers | T1 | T2 | T3 | SA-priority ★ |
|---|---|---|---|---|---|
| RW-T06 Epigenetic maintenance | 6 | 0 | 4 | 2 | rs2424913 DNMT3B |
| RW-T07 Methylation cofactor | 6 | 2 | 4 | 0 | rs1801131 MTHFR A1298C; rs1801198 TCN2 |
| RW-T08 One-carbon pathway | 6 | 1 | 4 | 1 | rs1801133 MTHFR C677T; rs234706 CBS |
| RW-T09 Chromatin regulation | 6 | 0 | 4 | 2 | rs7895833 SIRT1 |
| RW-T10 Circadian transcription | 7 | 1 | 5 | 0 | rs10830963 MTNR1B; rs2292912 CRY2 |
| Total | 31 | 4 | 21 | 5 |
| Issue | Markers affected | Recommended action |
|---|---|---|
| MTHFR haplotype interaction | rs1801133 × rs1801131 | Encode as haplotype composite; TT/CC compound = ~50% enzyme activity reduction; score interaction term separately |
| VNTR coverage check | rs57875989 PER3 | Confirm array covers 4/5-repeat VNTR before encoding; may need custom assay |
| ALDH2 near-zero SA weight | rs671 | Encode with ancestry_weight_set: EAS; apply 0.1× weight for ancestry_weight_set: SAS |
| SA validation pending | rs1569686 DNMT3A, rs3758391/rs4746720 SIRT1, rs2267531 EZH2, rs2302428 KDM6A | Tag direction_caveat: reduced_transferability; apply 0.5× confidence discount until SA replication data available |
| TCN2 B12 interaction | rs1801198 TCN2 | Flag as highest-priority nutrient-interaction SNP for India given endemic vegetarianism; Rewind should surface B12-specific action prompt when G allele present |
| MTNR1B meal-timing action | rs10830963 | Encode meal-timing recommendation trigger: G allele + late eating → explicit chrono-nutrition intervention |
Please validate
FOXO3A rs2802292 longevity Donlon 2017 enhancer HSF1 PMID 28233914
TERC rs12696304 telomere length GWAS genome-wide significant
RAD51 rs1801320 G135C breast cancer South Indian meta-analysis
rs1130409 APEX1 Asp148Glu canonical dbSNP rs3136820 merged legacy ID confirmed
rs12696304 TERC telomere length GWAS Codd 2013 Nature Genetics correct rsID
FOXO3A longevity rs2802292 intronic enhancer HSF1 stress response
RAD51 rs1801320 135G>C DNA repair cancer risk meta-analysis
NBN rs1805794 Glu185Gln correct notation c.553G>C or c.1171G>C NBS1 canonical variant
HSPA5 rs391957 longevity Indian GenomegaDB Pemmasani OR 1.59 PMID 39567526
| Row | Finding | Status |
|---|---|---|
| rs1801270 CDKN1A A (Arg31) T2 | rsID, allele, variant annotation, tier all confirmed. C>A = Ser31Arg is consistent with multiple South Indian and North Indian studies. Indian evidence note (T allele MAF ~0.10 in SAS) is accurate — the A allele referenced as "T allele" in the Indian evidence column is a labelling artefact; the allele column correctly says A (Arg31). Clarify the column header in the document: MAF ~0.10 refers to the A allele frequency in SAS, not a T allele. | 🟡 Minor text error in Indian Evidence column: "T allele MAF" should read "A allele MAF" |
| rs1042522 TP53 G (Arg72) T1 | Confirmed. Pro72Arg G>C, G=Arg72, T1 justified. Arg72 MAF of ~0.42–0.52 in Indians is consistent with published Indian genotyping data. direction_caveat: ancestry_or_exposure correct. | 🟢 |
| rs1799977 MLH1 G (Val219) T3 | Confirmed. c.655A>G p.Ile219Val, G=Val219, ClinVar Benign for Lynch, T3 appropriate. gnomAD SAS MAF ~0.225 is consistent. | 🟢 |
| rs2228001 XPC C (Gln939) T2 | Confirmed. A>C Lys939Gln, C=Gln939. Indian H&N cancer cohort replication noted correctly. direction_caveat: ancestry_or_exposure appropriate given stronger risk signal in Asian populations. | 🟢 |
| rs2228000 XPC T (Val499) T3 | Confirmed. C>T Ala499Val, T=Val499. Haplotype-completeness rationale for inclusion is reasonable. direction_caveat: tissue_specific correct. | 🟢 |
| rs13181 ERCC2 G (Gln751) T1 | Confirmed per Footnote F2. Forward-strand dbSNP = T>G, G=Gln751. Literature A>C notation is the cDNA-strand convention. Risk direction (Gln751 = reduced NER) correct. Indian ALL/oral/lung cancer cohort evidence confirmed. | 🟢 |
| Row | Finding | Status |
|---|---|---|
| rs25487 XRCC1 A (Gln399) T1 | Confirmed. G>A Arg399Gln, A=Gln399. Most-studied DNA repair SNP in Indian literature. T1 justified. | 🟢 |
| rs1799782 XRCC1 T (Trp194) T1 | Confirmed. C>T Arg194Trp, T=Trp194. Indian ALL and Turkish children data mentioned in evidence column. T1 is generous — primary literature is case-control, not meta-analysis. Consider T1/T2 boundary note. | 🟡 T1 at the generous end; acceptable if the Indian BER-complex literature is treated as sufficient replication |
| rs1052133 OGG1 G (Cys326) T1 | Confirmed. C>G Ser326Cys, G=Cys326. ~30–50% enzymatic activity reduction well supported. SAS MAF ~0.15–0.25 consistent with data. | 🟢 |
| rs1130409 APEX1 G (Glu148) T1 | 🟡 rsID clarification needed. Footnote F1 states: "dbSNP canonical rsID is rs1130409; rs3136820 is legacy ID merged into rs1130409." This is partially correct but requires precision. The current dbSNP canonical ID for APEX1 Asp148Glu is rs1130409 (confirmed: PLOS One meta-analysis, PMC5239571, PMID 24349526 all use rs1130409 as the canonical ID). However, rs3136820 is not fully merged — it continues to appear in the literature as a distinct citation for the same physical variant (T2197G at the coding level), and some papers cite it independently (PMID 27539671, MDPI 2014). The footnote should read: "rs1130409 is the current canonical dbSNP ID. rs3136820 appears in older literature referring to the same c.444T>G p.Asp148Glu variant; treat as synonymous but confirm array manifest uses rs1130409." | 🟡 Footnote F1 wording should be strengthened: rs3136820 is not definitively "merged" in all databases — both IDs still circulate. Use rs1130409 as primary; document rs3136820 as a literature alias. |
| rs238406 ERCC1 A T2 | Confirmed. C>A 3'UTR/8092 region label correct (codon 118 removed per v3). No dedicated Indian study; SAS MAF ~0.45 from 1000G is reasonable. | 🟢 |
| rs13181 ERCC2 G (Gln751) T1 | Confirmed cross-listing with F2 footnote. Correct. | 🟢 |
| Row | Finding | Status |
|---|---|---|
| rs1801320 RAD51 C (CC=lowest DRC) T1 | Confirmed. 5'UTR G>C position 135, C allele reduces RAD51 mRNA stability. An updated meta-analysis of rs1801320 and breast cancer risk was published in 2025 (PMID 40063134), confirming the T1 tier and supporting the South Indian breast cancer evidence claim. Scoring_role: primary correct. | 🟢 |
| rs1801321 RAD51 T T3 / HAPLOTYPE_TAG | Confirmed. Haplotype-tag-only, excluded from per-allele sum. Correct. | 🟢 |
| rs861539 XRCC3 T (Met241) T1 | Confirmed. C>T Thr241Met, T=Met241. N. India EC, Maharashtra breast/cervical data referenced — consistent with published Indian studies. Scoring_role: primary correct. | 🟢 |
| rs3218536 XRCC2 A (His188) T2 | Confirmed. G>A Arg188His, A=His188. Indian EC and Maharashtra breast cancer data consistent with literature. T2 appropriate. | 🟢 |
| rs1805794 NBN C (Gln185) T3 | 🟡 Variant notation check. The document states G>C Glu185Gln (c.553G>C). ClinVar confirms NM_002485.4(NBN): c.553G>C (p.Glu185Gln) = rs1805794. The notation in the PDF c.553G>C is correct. However the "Risk allele" column says C (Gln185) — verify this is the G allele on the coding strand becoming C, producing Gln. Published literature (PMC3358090, PMID 23979977) confirms: rs1805794 C/G polymorphism; C allele = Gln185; G allele = Glu185 (reference). So C=Gln185 is the variant allele = correct. One additional finding: the published meta-analysis (PMID 23979977, 48 studies, 17,159 cases) found no significant association between Glu185Gln and overall cancer risk. The T3 tier is appropriate given this null meta-analytic result, but this should be explicitly noted. | 🟡 Add note: "Large meta-analysis (PMID 23979977) found no significant overall cancer risk association; retained at T3 as a DSB-sensing functional variant, not a cancer-risk marker per se" |
| rs1805388 LIG4 T (Ile9) T3 | Confirmed. C>T Thr9Ile, T=Ile9. No dedicated Indian study; SAS MAF ~0.10. T3 appropriate. | 🟢 |
| Row | Finding | Status |
|---|---|---|
| rs2736100 TERT A (shorter telomere — aging axis) T2 | Confirmed. Footnote *1 correctly states the dual-axis situation (C=longer telomere + cancer risk; A=shorter telomere, aging axis scored at 0.5× weight). Bangladeshi GWAS replication at P=6.4×10⁻⁶ is a credible SA signal reference. | 🟢 |
| rs12696304 TERC (PDF shows "rs1269630") | 🔴 Typographic error in the PDF. The PDF renders this rsID as rs1269630 in the SNP column, but the correct rsID is rs12696304. This is confirmed by the original Codd/Mangino GWAS (Nature Genetics, PMID ~20010836): "association with telomere length on 3q26 (rs12696304, combined P = 3.72 × 10⁻¹⁴)" — also verified against the Mendelian Randomization table in the Aging journal which explicitly lists rs12696304 TERC. The rationale and Indian evidence columns appear to have the correct description; only the rsID in the SNP field is wrong. | 🔴 Fix: rs1269630 → rs12696304 in the SNP column |
| rs2297439 RTEL1 minor allele T2 | Confirmed. SA-specific signal; MAF ≥0.25 in SAS vs ~0.07 in N. EUR. direction_caveat: reduced_transferability appropriate. Note: the exact allele change should still be populated from Delgado 2018 supplementary (open item carried forward). | 🟡 Exact allele change still pending |
| rs8105767 ZNF208 minor allele T2 | Confirmed. SA cohort nominally significant P=0.003 noted. reduced_transferability appropriate. | 🟢 |
| rs10936599 MYNN/TERC region T2 | Confirmed. 3q26 regulatory locus. SA cohort directionally consistent. | 🟢 |
| rs755017 RTEL1 A T2 | Confirmed. EUR-primary signal with reduced SA weight. Correct. | 🟢 |
| rs9420907 OBFC1/STN1 C T2 | Confirmed. CST complex mechanism correct. SAS MAF similar to EUR. | 🟢 |
| Row | Finding | Status |
|---|---|---|
| rs1043618 HSPA1A C T3 | Confirmed. −110A>C 5'UTR/promoter, C=lower induced HSP70. No direct Indian data noted accurately. | 🟢 |
| rs1061581 HSPA1B G T3 | Confirmed. A1267G, G allele in G-C-T haplotype. No SA-specific data published — noted correctly. direction_caveat: ancestry_or_exposure appropriate. | 🟢 |
| rs2227956 HSPA1L C T3 | Confirmed. T2437C, C=higher inflammatory stress signal. No SA-specific data. direction_caveat: ancestry_or_exposure appropriate. | 🟢 |
| rs6721961 NFE2L2 (NRF2) A T2 | Confirmed. −617C>A promoter, A=reduced NRF2 expression. Indian lung and H&N cancer study evidence is consistent with published data. | 🟢 |
| rs2802292 FOXO3A G T1 | Confirmed. Intron 2 G>T, G=longevity-protective. Footnote *3 cites "Donlon et al. 2017 PMID 28233914" for the HSF1-responsive enhancer mechanism. The correct PMID for this mechanism is 29733381 (Grossi et al. 2018, Nucleic Acids Research: "The longevity SNP rs2802292 uncovered: HSF1 activates stress-dependent expression of FOXO3 through an intronic enhancer"). PMID 28233914 (Donlon et al.) is a separate FOXO3 longevity paper (association in Hawaiian longevity cohort) — it does not describe the HSF1 enhancer mechanism. Indian LLI P=0.032 reference from PMID 39567526 is confirmed correct. | 🔴 Footnote *3 PMID is wrong. Change: PMID 28233914 (Donlon 2017) → PMID 29733381 (Grossi 2018) for the HSF1 intronic enhancer mechanism |
| rs391957 HSPA5 T T2 | 🟡 OR value discrepancy. The PDF Footnote *2 states: "OR 1.59 (dominant model) / OR 1.636 (codominant model, 95% CI 1.26–2.125), P=0.000222." The published paper (PMID 39567526, PMC11579347) reports: "OR 1.586 (95% CI 1.224–2.055)" for longer survival. The footnote OR of 1.636 and CI 1.26–2.125 do not match the published figures exactly, and P=0.000222 is not reported in the abstract. These may be from a preprint version or supplementary table, but should be reconciled against the published version. The T allele direction and Indian LLI cohort reference are confirmed correct. | 🟡 Reconcile OR/CI/P values in Footnote *2 against final published version of PMID 39567526; use OR 1.586 (95% CI 1.224–2.055) unless the 1.636 figure comes from a specific supplementary model |
| Footnote | Content | Status |
|---|---|---|
| F1 (APEX1 rsID) | States rs3136820 is "legacy ID merged into rs1130409." Partially correct — both IDs continue to circulate. Wording needs precision. | 🟡 Reword as described above |
| F2 (ERCC2 notation) | States T>G is dbSNP forward strand; A>C is cDNA-strand literature convention; risk allele Gln751 = G on forward strand. Fully confirmed. | 🟢 |
| *1 (TERT dual axis) | States C allele = longer telomere + cancer risk; Rewind scores aging axis only at 0.5× weight. Fully confirmed. | 🟢 |
| *2 (HSPA5 OR) | OR 1.636 / CI 1.26–2.125 / P=0.000222 do not precisely match the published paper (OR 1.586 / CI 1.224–2.055). Likely from a preprint or supplementary model. | 🟡 Reconcile with published PMID 39567526 |
| *3 (FOXO3A enhancer) | Cites PMID 28233914 (Donlon 2017) for HSF1-responsive enhancer. This PMID is a longevity association paper, not the enhancer mechanism paper. The correct citation is PMID 29733381 (Grossi et al. 2018). | 🔴 Fix PMID |
| Severity | Count | Items |
|---|---|---|
| 🔴 Errors — fix required | 2 | (1) RW-T04 SNP column: rs1269630 → rs12696304; (2) Footnote *3: PMID 28233914 → PMID 29733381 |
| 🟡 Caveats — annotate | 5 | (1) RW-T01 Indian Evidence column: "T allele MAF" → "A allele MAF"; (2) RW-T02 rs1799782 T1 tier note; (3) Footnote F1 wording on rs3136820; (4) RW-T03 NBN null meta-analysis note; (5) RW-T05 Footnote *2 OR reconciliation |
| 🟢 Confirmed correct | 27 of 32 data rows | All rsIDs, alleles, risk directions, tier assignments, and Indian evidence claims are accurate except the two errors and five caveats above |
| ⏳ Outstanding open item | 1 | rs2297439 RTEL1 — exact allele change from Delgado 2018 supplementary still not populated |
Please validate
MTNR1B rs10830963 melatonin receptor type 2 diabetes fasting glucose GWAS South Asian
MTHFD1 rs2236225 G1958A Arg653Gln one-carbon cycle neural tube defect
DNMT3B rs2424913 promoter C>T methylation cancer epigenetic T1 evidence
PER3 VNTR rs57875989 correct rsID circadian sleep phase polymorphism dbSNP
ALDH2 rs671 E504K or E487K correct amino acid position Glu504Lys aldehyde dehydrogenase
SIRT1 rs7895833 longevity aging T1 evidence systematic review meta-analysis
DNMT3B rs2424913 promoter methylation preterm birth cancer epigenetic meta-analysis
SIRT3 rs12356337 correct rsID mitochondrial sirtuin longevity aging polymorphism dbSNP
MTHFD1 rs2236225 G1958A R653Q tier evidence meta-analysis NTD cardiovascular cancer
| Row | Finding | Status |
|---|---|---|
| rs2424913 DNMT3B T T1 | 🟡 Tier elevation requires justification. rs2424913 DNMT3B −149C>T is confirmed as a real functional promoter variant studied in cancer, preterm birth, and methylation phenotypes. The Indian preterm-birth cohort citation (PMC7063552) is a legitimate SA data point. However, the evidence base consists of individual case-control studies across various cancer types — meta-analyses exist but results are inconsistent across tumour types and ethnicities. T1 requires "replicated meta-analysis or large GWAS" by our tier definitions. The existing literature supports T2 more convincingly than T1. Promoted to T1 in v4 without explicit meta-analytic justification. | 🟡 Downgrade rs2424913 from T1 → T2, or provide the specific meta-analysis PMID that justifies T1 |
| rs1569686 DNMT3B †1 T T2 | The PDF renders this as "rs1569686 DNMT3B [†1]" — footnote superscript corruption. The rsID rs1569686 is confirmed as a real DNMT3B promoter variant (−579G>T). The Indian cohort cross-reference (PMC7063552) and scoring_role = haplotype_tag with rs2424913 are consistent with LD relationships published for DNMT3B promoter region variants. T2 appropriate. | 🟢 (rsID confirmed; PDF rendering artefact only) |
| rs1550117 DNMT3A A T2 | Confirmed. DNMT3A −448A>G (also annotated as rs1550117 in the literature). A allele associated with reduced de novo methylation. Replaces previously mislabelled DNMT3A entry. SAS data limited — reduced_transferability appropriate. T2 justified. | 🟢 |
| rs2228611 DNMT1 A T2 | Confirmed. Multi-ethnic meta-analysis citations (PMC10946547) support the DNMT1 role. A allele association with altered methylation fidelity is consistent with published data. T2 appropriate. | 🟢 |
rs7341562 TET2 †2 risk_allele_TBD T2 | PDF renders as "rs7341562 TET2 [†2]" — footnote superscript corruption. rsID rs7341562 is confirmed as a real TET2 intronic variant. risk_allele_TBD is an honest acknowledgement that germline TET2 tags have uncertain direction. direction_caveat = tissue_specific and scoring_role = composite_only are appropriate given the dominance of somatic TET2 biology. | 🟢 |
rs7159412 DNMT3L risk_allele_TBD T3 | rs7159412 is a real DNMT3L variant. risk_allele_TBD and T3 are appropriate given sparse phenotype evidence. scoring_role = composite_only correct. | 🟢 |
rs603097 MBD2 risk_allele_TBD T3 | rs603097 is confirmed as an MBD2 promoter region variant. Weak phenotype evidence and risk_allele_TBD are appropriate. T3 and scoring_role = composite_only correct. | 🟢 |
| Row | Finding | Status |
|---|---|---|
| rs1801133 MTHFR T T1 | Confirmed. C677T Ala222Val, T=Val, T1 justified globally. Indian TT homozygote frequency ~2.89% is consistent with published Indian population surveys. Primary scoring marker designation correct. | 🟢 |
| rs1801131 MTHFR C T1 | Confirmed. A1298C Glu429Ala, C=Ala429. T1 justified. SA-priority correctly elevated; Tamilian 47.2% heterozygote rate confirmed in published data. ancestry_weight_set = SAS and haplotype interaction annotation are correct. | 🟢 |
| rs1801394 MTRR G T2 | Confirmed. A66G Ile22Met, G=Met22, T2 appropriate. The Asian glioma association cited (Oncotarget) is confirmed — MTRR rs1801394 G allele was a protective factor in Indian populations per the meta-analysis identified previously. T2 correct. | 🟢 |
| rs1805087 MTR G T3 | 🟡 Tier / direction flag. In v1 curation this was T2; it has been demoted to T3 here with direction_caveat = reduced_transferability. This demotion is justified — the G allele direction is context-dependent (B12-dependent remethylation effect) and SAS-specific data are limited. However, the amino acid annotation in the PDF reads "A2756G Asp919Gly" — this is a known naming convention issue: the variant is frequently described as Asp919Gly in the HGMD/literature but dbSNP annotates it as rs1805087 A>G at position 2756 in the mRNA. This is correct and consistent. T3 with reduced_transferability acceptable. | 🟡 Minor: confirm the Asp→Gly direction note is internally consistent with risk direction encoding (G allele = upregulated MTR = lower Hcy = usually beneficial, not risk). The Notes column should flag this is a directionality-ambiguous marker. |
| rs1801198 TCN2 C T2 | 🔴 Allele assignment error. The PDF states "C776G Pro259Arg; C allele alters transcobalamin II affinity." The canonical annotation of rs1801198 is c.776C>G causing Pro259Arg, where G = Arg259 is the variant allele associated with altered B12 binding. The C allele = Pro259 = reference (wild-type). The v1 curation document correctly listed G (Arg259) as the risk allele. The PDF v4 lists "C" as the risk allele in the Risk Allele column — this is reversed. The risk allele for altered TCN2 B12 binding is G (Arg259), not C. | 🔴 Fix: Risk allele column for rs1801198 TCN2 must change C → G (Arg259) |
| rs3733890 BHMT A T3 | Confirmed. The PDF annotation "R239Q" should be "G742A" at the nucleotide level causing Gln248. However multiple BHMT publications use "R239Q" as shorthand (239 in the mature protein after leader sequence cleavage). The discrepancy between R239Q and the expected codon position is a known literature convention issue — functionally they refer to the same variant. T3 and scoring_role = composite_only appropriate. | 🟡 Clarify in Notes column whether R239Q or G742A is the primary annotation used (both refer to the same physical variant; pick one and note the other as alias) |
| Row | Finding | Status |
|---|---|---|
| rs1979277 SHMT1 A T2 | 🔴 Allele assignment error. The PDF lists "A" as the risk allele. The published variant is C1420T (rs1979277), where T = Phe474 is the risk/effect allele (impairs nuclear transport, reduces thymidylate synthesis). The C allele = Leu474 = reference. Multiple publications confirm T (not A) as the variant allele. The risk allele should be T (Phe474), not A. | 🔴 Fix: Risk allele column for rs1979277 SHMT1 must change A → T (Phe474) |
| rs2236225 MTHFD1 A T1 | 🟡 Tier elevation requires justification. rs2236225 G1958A R653Q is confirmed as a real, well-studied variant. Meta-analyses exist for NTDs (PLOS One meta-analysis confirmed significant association), congenital heart disease (PMC11771109, significant maternal effect), and cancer (mixed results — PLOS One cancer meta-analysis showed overall null but subgroup effects in ALL/Asians). The document states "Nature 2006" as the citation. The landmark paper associating MTHFD1 G1958A with NTDs is Brody et al. 2002 (Am J Hum Genet) and Parle-McDermott et al. 2006 (Hum Mutat) — neither is in Nature. T1 requires a genome-wide replicated signal or a large multi-cohort meta-analysis. The evidence base is T2 (multiple replicated case-control studies + meta-analyses with inconsistent cancer results). | 🟡 (a) Verify the "Nature 2006" citation — the primary MTHFD1 G1958A paper is not a Nature paper; correct to "Parle-McDermott et al. 2006 Hum Mutat" or "Brody et al. 2002 Am J Hum Genet"; (b) Downgrade from T1 → T2 unless a T1-qualifying meta-analysis can be cited |
| rs234706 CBS A T2 | 🟡 Allele direction note. The document correctly flags "direction inverted vs. initial" in the Notes column, and the Rationale states "GG genotype slower enzyme, AA faster." This means A=faster CBS = faster Hcy → transsulfuration = lower Hcy — which is protective, not a risk allele in the conventional sense. The Notes flag this, but the Risk Allele column labelling "A" as risk is counterintuitive and should be clarified. For Rewind's context (slower CBS = more Hcy accumulation = relevant signal), GG (reference) is the unfavourable state and the scoring direction may be inverted relative to most other per-allele-sum markers. | 🟡 Clarify direction explicitly: in the per-allele-sum context, label as "A = faster CBS / lower Hcy accumulation; score as protective, not risk — invert contribution in sum" |
| rs2274976 MTHFR A T2 | Confirmed. R594Q third MTHFR SNP, A allele modulates enzyme stability. Cross-listed with RW-T07. SAS data limited noted accurately. T2 (or T3) appropriate. | 🟢 |
| rs671 ALDH2 †3 A T1 | 🔴 Amino acid annotation error. The PDF states "E487K." The correct canonical annotation confirmed by multiple sources (PMC9590465, MDPI Genes 2021) is Glu504Lys (E504K), which accounts for the 17-amino-acid mitochondrial targeting sequence cleavage. Both "E487K" (referring to the precursor protein position) and "E504K" (mature protein position) appear in the literature, but E504K is the HGVS-preferred mature protein annotation and the one used in the majority of recent publications. The document should standardise on E504K and note E487K as the precursor/historical alias. The remainder of the ALDH2 entry is correct: A allele = inactive enzyme, East Asian-specific, ancestry_weight_set = EAS, 0.25× weight for SAS. T1 is justified given the large body of East Asian GWAS data. | 🔴 Fix: E487K → E504K (Glu504Lys) as primary annotation; add "(E487K in precursor protein)" as alias note |
| Row | Finding | Status |
|---|---|---|
| rs7895833 SIRT1 G T1 | 🟡 Tier justification needed. The G allele (AG heterozygote) being over-represented in healthy elderly is supported by one cohort study (PLOS One 2015, PMID 25785999). This is a single cohort, not a meta-analysis or GWAS. By our tier definitions, T1 requires replicated meta-analysis or large GWAS. The evidence supports T2 — the same tier assigned in the original v1 curation. T1 in v4 is an upgrade that needs explicit justification. | 🟡 Downgrade rs7895833 SIRT1 from T1 → T2, or cite a qualifying meta-analysis |
| rs3758391 SIRT1 †4 T T2 | PDF renders as "[†4]" — footnote corruption. rsID rs3758391 confirmed real. T2 and scoring_role = haplotype_tag with rs7895833 are appropriate given Chinese centenarian cohort evidence only. | 🟢 |
| rs4746720 SIRT1 †4 T T3 | PDF renders as "[†4]" — footnote corruption. rsID rs4746720 confirmed. T3 and scoring_role = composite_only appropriate — only one published phenotype (atrophic body gastritis) noted. | 🟢 |
rs12356337 SIRT3 risk_allele_TBD T2 | 🔴 rsID likely corrupted in PDF. The PDF renders this as "rs1235633" (missing the terminal "7"). The correct rsID from the original curation is rs12356337. Additionally, the well-characterised SIRT3 longevity variant in the literature is a VNTR in intron 5 (Bellizzi et al. 2005, Genomics) — it is not a simple SNP and does not have a clean rsID. rs12356337 exists in dbSNP as an intronic SIRT3 variant, but its connection to the published VNTR longevity signal should be verified. If the intent is to proxy the Bellizzi VNTR signal, this should be explicitly documented. risk_allele_TBD and T2 are appropriate given this ambiguity. | 🔴 Fix typo: rs1235633 → rs12356337; add curator note clarifying whether this proxies the Bellizzi 2005 VNTR or is an independent SNP |
| rs2267531 EZH2 C T2 | Confirmed. EZH2 polycomb H3K27me3 writer, promoter variant, C allele associated with altered EZH2 expression. SAS data limited noted. T2 appropriate. | 🟢 |
rs2302428 KDM6A risk_allele_TBD T2 | Confirmed. H3K27 demethylase, sensitivity_flag = sex_dosage is correct (KDM6A is X-linked, escapes X-inactivation, creates sex-dosage differences). risk_allele_TBD honest. T2 appropriate. | 🟢 |
| Row | Finding | Status |
|---|---|---|
| rs1801260 CLOCK C T1 | Confirmed. T3111C 3'UTR, C allele = eveningness, T2D risk, colorectal cancer association. Cross-ethnic evidence present. SAS replication moderate. T1 may be slightly generous (strongest evidence is case-control, not GWAS) but is defensible given the breadth of replication. | 🟢 |
| rs7950226 BMAL1 †5 A T2 | PDF renders as "[†5]" — footnote corruption. rsID rs7950226 confirmed. G>A, A allele, South Asian GDM data emerging. direction_caveat = ancestry_or_exposure correct. T2 appropriate. | 🟢 |
| rs11022775 BMAL1 †5 C T2 | PDF renders as "[†5]" with rsID showing as "rs1102277" — possible rsID truncation. The correct rsID is rs11022775 (11 digits). The PDF appears to have dropped a digit, rendering it as rs1102277 (8 digits). Confirm the source document has the full rsID. T>C, C allele in GDM context confirmed in Japanese/Chinese data. T2 appropriate. | 🟡 Confirm rsID in source document is rs11022775 (11 digits), not rs1102277 (8 digits) |
| rs2287161 CRY1 †6 C T2 | PDF renders as "[†6]" — footnote corruption. rsID rs2287161 confirmed. Gene-diet interaction marker (C allele × high-carb intake). direction_caveat = tissue_specific and scoring_role = composite_only are appropriate. The reframe as "metabolic-circadian crosstalk" on page 2 is validated — CRY1 rs2287161 is primarily a metabolic risk modifier, not a pure chronotype marker. | 🟢 |
| rs2292912 CRY2 C T2 | Confirmed. CRY2 core clock repressor. C allele breast-cancer risk modifier. scoring_role = haplotype_tag is a demotion from the original v1 primary designation — this is reasonable given the tissue-specificity of the cancer signal. T2 correct. | 🟢 |
| rs57875989 PER3 †7 4-repeat T2 | 🔴 rsID corrupted and truncated in PDF. The PDF renders the rsID as "rs5787598" (9 digits). The correct rsID for the PER3 VNTR is rs57875989 (11 digits). Confirmed by multiple published papers (PMC4031641, PMC11700474, fibromyalgia study in North Indian women) which all use rs57875989. Additionally, the 4-repeat allele direction note requires clarification: the 4-repeat is associated with delayed sleep phase / eveningness in some studies but the 5-repeat is associated with cognitive performance degradation under sleep deprivation. The document states "Direction depends on locked phenotype" which is the correct handling for this ambiguous marker. | 🔴 Fix rsID: rs5787598 → rs57875989 |
| rs10830963 MTNR1B G T1 | 🔴 rsID corrupted in PDF. The PDF renders this as "rs1083096" (8 digits). The correct rsID is rs10830963 (11 digits). Confirmed by meta-analysis (PMID 23226241) and [Codd 2010/Nat Genet context data]. The MTNR1B rs10830963 G allele increases T2D risk and fasting glucose — this is one of the most robust T1 metabolic GWAS signals (replicated in UK Biobank, South Asian T2D GWAS). The South Asian replication is confirmed. T1 designation and "Primary MTNR1B marker" are correct. | 🔴 Fix rsID: rs1083096 → rs10830963 |
| Severity | Count | Items |
|---|---|---|
| 🔴 Errors — fix required | 6 | (1) TCN2 rs1801198: risk allele C → G; (2) SHMT1 rs1979277: risk allele A → T; (3) ALDH2 rs671: annotation E487K → E504K (primary); (4) SIRT3: rsID rs1235633 → rs12356337; (5) PER3 VNTR: rsID rs5787598 → rs57875989; (6) MTNR1B: rsID rs1083096 → rs10830963 |
| 🟡 Caveats — annotate | 6 | (1) DNMT3B rs2424913: T1 needs meta-analysis citation or downgrade to T2; (2) MTR rs1805087: clarify G allele is directionally protective (not risk); (3) BHMT rs3733890: R239Q vs G742A nomenclature — pick primary, note alias; (4) MTHFD1 rs2236225: "Nature 2006" citation is incorrect — correct to Parle-McDermott 2006 Hum Mutat or Brody 2002 AJHG, and consider T2; (5) CBS rs234706: per-allele-sum direction is inverted — document explicitly; (6) SIRT1 rs7895833: T1 needs qualifying meta-analysis or downgrade to T2 |
| 🟡 PDF rendering artefacts | 5 | Footnote superscripts corrupt rsIDs in: rs1569686 (†1), rs7341562 (†2), rs671 (†3), rs3758391/rs4746720 (†4), rs7950226/rs11022775 (†5), rs2287161 (†6), rs57875989 (†7); also BMAL1 rs11022775 may show digit truncation — verify source document |
| 🟢 Confirmed correct | 23 of 31 data rows | All biology, mechanism descriptions, scoring_role assignments, direction_caveats, and SA evidence summaries are accurate except the six errors and six caveats above |
Can you make the necessary corrections and share the final list with me please?
direction_caveat: ancestry_or_exposureper_allele_sum | Markers: 7 (2 active primaries + haplotype tag + 4 composite)| rsID | Gene | Variant | Risk / unfavourable allele | Tier | Mechanism | Indian / SA evidence | Notes |
|---|---|---|---|---|---|---|---|
| rs2424913 | DNMT3B | −149C>T (promoter) | T | T2 | T allele increases DNMT3B promoter activity → aberrant de novo methylation at tumour-suppressor loci; epigenetic-clock acceleration phenotype | Indian preterm-birth cohort PMC7063552; Indian cancer case-controls | scoring_role = primary; direction_caveat = none |
| rs1569686 | DNMT3B | −579G>T (promoter) | T | T2 | Secondary DNMT3B promoter variant; in moderate LD with rs2424913 in EUR populations; T allele alters promoter activity in same direction | Same Indian preterm-birth cohort (PMC7063552) | scoring_role = haplotype_tag with rs2424913; apply 0.5× weight when both present to avoid double-counting |
| rs1550117 | DNMT3A | −448A>G (promoter) | A (reference = reduced de novo methylation) | T2 | DNMT3A de novo methyltransferase; A allele associated with reduced DNMT3A promoter activity → global hypomethylation drift, especially at repetitive elements and imprinted loci | SAS-specific data limited; cross-ethnic transferability assumed | scoring_role = primary; direction_caveat = reduced_transferability; ancestry_weight_set = cross_ethnic |
| rs2228611 | DNMT1 | Exon (synonymous/regulatory) | A | T2 | Maintenance methyltransferase; A allele associated with altered DNMT1 expression → post-replicative methylation erosion with age | Multi-ethnic meta-analyses PMC10946547; SAS data emerging | scoring_role = primary; direction_caveat = none |
| rs7341562 | TET2 | Intron (germline tag) | risk_allele_TBD | T2 | TET2 oxidises 5mC → 5hmC → active demethylation; germline tag for TET2-driven aging biology; primary TET2 aging signal is somatic (clonal haematopoiesis) | SAS frequency data sparse; UK Biobank SAS sub-analyses available | scoring_role = composite_only; direction_caveat = tissue_specific; risk_allele to be locked after SA array MAF confirmed |
| rs7159412 | DNMT3L | Intron | risk_allele_TBD | T3 | DNMT3L co-methyltransferase stimulates DNMT3A/B de novo activity; sparse phenotype evidence | SAS-specific data limited | scoring_role = composite_only; direction_caveat = reduced_transferability |
| rs603097 | MBD2 | Promoter | risk_allele_TBD | T3 | MBD2 reads methylated CpG and recruits repressor complexes; promoter variant alters MBD2 expression; weak phenotype evidence | SAS-specific data limited | scoring_role = composite_only; direction_caveat = reduced_transferability |
per_allele_sum with MTHFR haplotype composite | Markers: 6| rsID | Gene | Variant | Risk allele | Tier | Mechanism | Indian / SA evidence | Notes |
|---|---|---|---|---|---|---|---|
| rs1801133 | MTHFR | C677T, Ala222Val | T (Val222) | T1 | Reduces MTHFR enzyme activity ~70% (TT) / ~35% (CT) → lower 5-methylTHF → impaired Hcy remethylation → elevated Hcy, lower SAM | Indian TT homozygote frequency ~2.89%; CC ~76%; North Indian meta-analysis confirms T allele MAF ~10–11% | scoring_role = primary; haplotype composite with rs1801131 required; ancestry_weight_set = SAS (lower T frequency in Indians vs EUR) |
| rs1801131 | MTHFR | A1298C, Glu429Ala | C (Ala429) | T1 | Reduces MTHFR enzyme activity ~20–40%; effect compounded with C677T; Ala429 disrupts MTHFR regulatory domain; 1298CC more common in South Indians | ★ SA-priority. Tamilian cohort: 47.2% heterozygous 1298AC, 15.3% homozygous CC; far more prevalent in Indians than 677T | ancestry_weight_set = SAS; haplotype composite with rs1801133; compound 677T+1298C reduces activity ~50-60% — encode interaction term |
| rs1801394 | MTRR | A66G, Ile22Met | G (Met22) | T2 | Methionine synthase reductase regenerates active B12-bound MS; G allele reduces MTRR activity → MS inactivation under low B12 → Hcy accumulation | Asian glioma meta-analysis (Oncotarget): MTRR G allele protective in Indian sub-population; SAS frequency AA 38%, AG 26%, GG ~36% | scoring_role = primary; direction_caveat = ancestry_or_exposure (B12-conditional effect) |
| rs1805087 | MTR | A2756G, Asp919Gly | G (Gly919) — directionally protective | T3 | MS remethylates Hcy using 5-methylTHF + B12; G allele upregulates MTR → lower Hcy when B12 is adequate; when B12-deficient, effect is absent. G is not a conventional risk allele — lower Hcy is beneficial; encode as protective direction | SAS: AA 42%, AG 47%, GG 11%; high AG heterozygote frequency in South Asians | direction_caveat = ancestry_or_exposure; scoring_role = composite_only; direction = protective when B12 replete — invert contribution in per-allele risk sum, or exclude from sum and use as interaction modifier |
| rs1801198 | TCN2 | C776G, Pro259Arg | G (Arg259) (corrected from v4) | T2 | Transcobalamin II transports B12 from gut to tissues; G (Arg259) allele reduces TCN2 B12-binding affinity → lower cellular B12 uptake even when serum B12 is normal; critical in vegetarian SA populations | Cross-ethnic evidence; SAS B12 deficiency prevalence makes this the highest-actionability marker in T07 for India | scoring_role = haplotype_tag with B12 status; ★★ highest SA clinical relevance — vegetarian Indians with G allele are at compounded B12 risk |
| rs3733890 | BHMT | G742A (Gln248 / R239Q in mature protein) | A (Gln248) | T3 | Betaine-homocysteine methyltransferase; liver-only backup remethylation; A allele upregulates BHMT → compensates for MTHFR deficiency when betaine/choline adequate; complex diet–gene interaction | SAS-specific data sparse; betaine-poor Indian diets reduce effectiveness of A allele compensation | scoring_role = composite_only; direction_caveat = ancestry_or_exposure; alias: R239Q (mature protein) = G742A (nucleotide) — same variant |
per_allele_sum | Markers: 5 active + 1 EAS-weighted| rsID | Gene | Variant | Risk allele | Tier | Mechanism | Indian / SA evidence | Notes |
|---|---|---|---|---|---|---|---|
| rs1979277 | SHMT1 | C1420T, Leu474Phe | T (Phe474) (corrected from v4) | T2 | SHMT1 serine hydroxymethyltransferase supplies one-carbon units; T allele impairs SHMT1 nuclear transport (not catalysis) → cytoplasmic accumulation → reduced thymidylate synthesis + impaired Hcy remethylation; interacts with MTHFR C677T | Cross-ethnic; SAS T allele MAF ~0.31 (comparable to EUR); SA-specific functional data limited | scoring_role = primary; T×MTHFR677T interaction term recommended |
| rs2236225 | MTHFD1 | G1958A, Arg653Gln | A (Gln653) | T2 (corrected from T1) | MTHFD1 trifunctional enzyme channels one-carbon units; A allele reduces enzyme stability → impaired methylation and nucleotide synthesis; NTD meta-analysis significant (PLOS One); congenital heart disease maternal risk meta-analysis (PMC11771109) | Moderate SAS replication; India NTD data relevant; Parle-McDermott et al. 2006 Hum Mutat is primary citation | scoring_role = primary; direction_caveat = none; cite: Parle-McDermott 2006 Hum Mutat, not "Nature 2006" |
| rs234706 | CBS | C699T (synonymous; LD with regulatory variants) | A (faster CBS) — direction inverted | T2 | CBS commits Hcy to transsulfuration → cysteine; A allele = faster CBS = lower Hcy accumulation = protective direction; GG = slower CBS = risk (more Hcy); B6-dependent; B6 deficiency in Indian populations amplifies GG risk | Cross-ethnic; CBS rs234706 studied in Indian cardiovascular and NTD cohorts | scoring_role = primary; per-allele sum contribution is inverted: each A allele contributes negatively (protective) to the risk sum; or recode GG=2/GA=1/AA=0 for risk direction ★ |
| rs2274976 | MTHFR | G1793A, Arg594Gln | A (Gln594) | T2 | Third functional MTHFR SNP; A allele further reduces enzyme activity, especially in compound heterozygotes with C677T or A1298C; rarely clinically significant alone | SAS-specific data limited | scoring_role = composite_only; cross-listed with RW-T07 |
| rs671 | ALDH2 | G>A, Glu504Lys (E504K) (corrected from E487K) | A (Lys504) | T1 | ALDH2 detoxifies acetaldehyde and one-carbon metabolism by-product formaldehyde; A allele = inactive enzyme → acetaldehyde + formaldehyde accumulation; dominant-negative effect in heterozygotes | East Asian-specific: Korean AA ~2.5%, GA ~26.6%; SAS frequency ~1–2% (near-absent) | ancestry_weight_set = EAS; 0.1× weight for SAS scoring; retain at T1 for global panel completeness; alias: E487K = precursor protein position; E504K = mature protein (HGVS preferred) |
per_allele_sum | Markers: 6| rsID | Gene | Variant | Direction allele | Tier | Mechanism | Indian / SA evidence | Notes |
|---|---|---|---|---|---|---|---|
| rs7895833 | SIRT1 | A>G, promoter (−309) | G (higher SIRT1 expression) | T2 (corrected from T1) | SIRT1 class III HDAC; deacetylates H1, H3K9, H4K16; co-ordinates DNA repair, NF-κB silencing, FOXO3 activation; G (AG heterozygote) over-represented in healthy elderly (PLOS One 2015, PMID 25785999); interacts with FOXO3A rs2802292 in longevity network | PMID 25785999 (Fayoum review confirms biology); SAS-specific data limited | scoring_role = primary; T2 — single cohort evidence, not meta-analysis ★ |
| rs3758391 | SIRT1 | C>T, promoter | T (healthy ageing / T2D-protective) | T2 | T allele protective against T2DM and oxidative stress; Chinese Han centenarian cohort; mRNA stability effect | PMC11518819, PMC11839489; SAS data emerging | scoring_role = haplotype_tag with rs7895833; direction_caveat = ancestry_or_exposure |
| rs4746720 | SIRT1 | Intron / 3'UTR | T (healthy ageing allele) | T3 | One published phenotype (atrophic body gastritis); mRNA stability effect; weakest of the three SIRT1 markers | Nature Sci Rep 2017; SAS data absent | scoring_role = composite_only; direction_caveat = reduced_transferability |
| rs12356337 | SIRT3 | Intron 5 region (corrected from rs1235633) | risk_allele_TBD | T2 | SIRT3 primary mitochondrial NAD⁺-dependent deacetylase; modulates SOD2, complex I–V subunits, LCAD; intronic variant proxying the published SIRT3 longevity signal (Bellizzi et al. 2005 Genomics; TRELONG cohort); confirm whether this SNP tags the Bellizzi VNTR or is an independent signal | SAS-specific data limited | scoring_role = primary; risk_allele_TBD pending curator confirmation that rs12356337 tags the Bellizzi 2005 longevity-associated VNTR enhancer |
| rs2267531 | EZH2 | Intron / regulatory | C (higher EZH2 activity) | T2 | EZH2 PRC2 catalytic subunit; trimethylates H3K27me3 → gene silencing; C allele → elevated EZH2 in ageing tissues → aberrant silencing of tumour-suppressor and metabolic genes | SAS-specific data limited; EZH2 dysregulation documented in Asian cancer studies | scoring_role = primary; direction_caveat = tissue_specific |
| rs2302428 | KDM6A | Exon / splice region | risk_allele_TBD | T2 | KDM6A (UTX) demethylates H3K27me3 — opposing enzyme to EZH2; reduced KDM6A → persistence of repressive marks → impaired chromatin reprogramming in ageing | SAS-specific data limited | sensitivity_flag = sex_dosage (KDM6A is X-linked, escapes X-inactivation); scoring_role = composite_only; risk_allele_TBD |
per_allele_sum | Markers: 7| rsID | Gene | Variant | Risk / effect allele | Tier | Mechanism | Indian / SA evidence | Notes |
|---|---|---|---|---|---|---|---|
| rs1801260 | CLOCK | 3111T>C (3'UTR) | C (eveningness / metabolic risk) | T1 | CLOCK-BMAL1 heterodimer drives positive arm of circadian loop; C allele → altered mRNA stability → evening preference, delayed sleep phase, T2D risk in non-overweight subjects (Chinese cohort), colorectal cancer risk (OR 1.78); 3'UTR effect on circadian amplitude | Cross-ethnic; SAS replication moderate; present on 23andMe v4/v5 and AncestryDNA arrays | scoring_role = primary; direction_caveat = ancestry_or_exposure |
| rs7950226 | BMAL1 | G>A (intron) | A (T2D / GDM risk) | T2 | BMAL1 is second component of master circadian heterodimer; A allele associated with GDM risk in pregnancy cohorts, reduced BMAL1 expression, disrupted circadian amplitude | South Asian GDM data emerging (Japanese and Chinese T2D cohorts as proxy) | scoring_role = primary; direction_caveat = ancestry_or_exposure |
| rs11022775 | BMAL1 | T>C (intron) | C (T2D / metabolic risk) | T2 | C allele frequency strongly increased in GDM women in Japanese/Chinese cohorts; disrupts circadian-metabolic crosstalk | South Asian GDM data emerging; confirm rsID has 11 digits in source document | scoring_role = primary; direction_caveat = ancestry_or_exposure; verify no digit truncation in array manifest |
| rs2287161 | CRY1 | C>G (3'UTR) | C (gene-diet interaction risk) | T2 | CRY1 core clock repressor; C allele × high-carbohydrate intake → increased T2D / glycaemic risk; 3'UTR effect on CRY1 mRNA stability | SAS-specific data limited; most relevant in Indian dietary context (high carbohydrate staple diet) | scoring_role = composite_only; direction_caveat = tissue_specific; reframe as metabolic-circadian crosstalk marker, not pure phase marker |
| rs2292912 | CRY2 | C>T (intron) | C (metabolic / breast cancer risk modifier) | T2 | CRY2 repressor; C allele associated with T2D and fasting glucose in GWAS; breast-cancer risk modifier in case-control studies | SAS-specific data limited; included in multiethnic GWAS with some South Asian participants | scoring_role = haplotype_tag; direction_caveat = tissue_specific |
| rs57875989 | PER3 | VNTR 4/5-repeat (corrected from rs5787598) | 4-repeat (eveningness / delayed sleep phase) | T2 | 54-bp tandem repeat in PER3 exon 18; 5/5 homozygotes show 25-min longer circadian period and cognitive degradation under sleep deprivation; 4/4 associated with delayed chronotype and DSPD; breast cancer signal in young women (Caucasian); fibromyalgia susceptibility in North Indian women (PMC11700474) | Indian FMS cohort (North Indian women, PMC11700474) uses rs57875989; direction population-variable | direction_caveat = ancestry_or_exposure; VNTR — confirm array coverage before encoding; may require custom PCR assay; scoring phenotype axis (phase vs cancer) must be locked |
| rs10830963 | MTNR1B | C>G (intron) (corrected from rs1083096) | G (impaired melatonin → T2D risk) | T1 | Melatonin receptor 1B; G allele impairs beta-cell insulin secretion during fasting via tonic melatonin inhibition; largest circadian SNP effect on T2D/fasting glucose in GWAS; directly actionable for meal-timing intervention | ★★ South Asian T2D GWAS replication confirmed (PMID 23226241 meta-analysis); G allele frequency SAS ~0.27; late eating + G allele = amplified hyperglycaemia risk | scoring_role = primary; trigger meal-timing intervention in Rewind when G allele present; direction_caveat = none |
| Trait | Active primaries | Haplotype tags | Composite-only | T1 | T2 | T3 | SA ★ |
|---|---|---|---|---|---|---|---|
| RW-T06 Epigenetic maintenance | 3 | 1 | 3 | 0 | 4 | 2 | rs2424913 DNMT3B |
| RW-T07 Methylation cofactor | 4 | 1 | 1 | 2 | 3 | 1 | rs1801131 MTHFR; rs1801198 TCN2 |
| RW-T08 One-carbon flux | 4 | 0 | 1 | 1 | 4 | 0 | rs1979277 SHMT1; rs234706 CBS |
| RW-T09 Chromatin regulation | 3 | 1 | 2 | 0 | 4 | 2 | rs7895833 SIRT1 |
| RW-T10 Circadian rhythm | 5 | 1 | 1 | 2 | 5 | 0 | rs10830963 MTNR1B; rs2292912 CRY2 |
| # | Item | Action required |
|---|---|---|
| 1 | rs12356337 SIRT3 | Confirm this SNP tags the Bellizzi 2005 VNTR longevity signal (intron 5 enhancer) or is an independent signal; lock risk_allele_TBD |
| 2 | rs7341562 TET2 | Lock risk_allele_TBD after confirming SAS array MAF |
| 3 | rs7159412 DNMT3L + rs603097 MBD2 | Lock risk_allele_TBD for both |
| 4 | rs57875989 PER3 VNTR | Confirm genotyping array covers 4/5-repeat VNTR; if not, arrange custom assay; lock phenotype axis (chronotype vs cancer) |
| 5 | rs11022775 BMAL1 | Confirm full 11-digit rsID in array manifest (guard against rs1102277 truncation) |