Skip to content

What is a polygenic score? A comprehensive guide to understanding your genetic risk

5 min read

According to research published in Nature Reviews Genetics, the majority of genome-wide association studies (GWAS) have focused on European-ancestry populations, limiting the generalizability of some genetic findings. Understanding these nuances is key to grasping what is a polygenic score, a metric that aggregates genetic information to inform health insights.

Quick Summary

A polygenic score (PGS) is a quantitative measure that aggregates the combined effects of thousands or millions of genetic variants to estimate an individual's relative genetic predisposition for a specific trait or disease within a population.

Key Points

  • Aggregated Genetic Risk: A polygenic score combines the effects of many genetic variants (SNPs) across the genome into a single number to estimate an individual's genetic predisposition for a trait.

  • Relative, Not Absolute: The score provides a measure of relative risk compared to others in a specific reference population, not a guaranteed outcome.

  • Derived from GWAS: The calculation relies on data from large-scale Genome-Wide Association Studies (GWAS) to determine the weight and impact of different genetic variants.

  • Probabilistic, Not Deterministic: A high score indicates an elevated genetic likelihood but does not mean an individual will definitely develop the trait or disease; lifestyle and environment are also crucial factors.

  • Not for Monogenic Conditions: PGS are for complex traits and common diseases influenced by multiple genes, unlike monogenic conditions caused by a single, high-impact mutation.

  • Applications in Health: Can be used to personalize disease screening, aid diagnosis, inform clinical trials, and advance research into disease etiology.

In This Article

What Exactly Is a Polygenic Score?

At its core, a polygenic score (PGS) is a summary statistic that distills complex genetic information into a single number. For most traits and common diseases, a person’s risk isn't determined by a single gene mutation but by the cumulative effect of many small genetic variations, known as single-nucleotide polymorphisms (SNPs), spread across their genome. A PGS is calculated by summing up the effects of these numerous variants, with each variant weighted by how strongly it's associated with a particular trait. The result is a personalized score that places an individual on a genetic risk continuum relative to other people in a given population. When applied to health, this is often called a polygenic risk score (PRS).

How Is a Polygenic Score Calculated?

The calculation of a PGS is a sophisticated process that leverages a massive amount of genetic data. It primarily relies on data from genome-wide association studies (GWAS), which compare the genetic variants of individuals with a particular trait or disease to those without it. This allows researchers to identify specific SNPs and estimate their effect sizes—how much each variant influences the trait.

The process generally follows these steps:

  1. Data Collection: Large-scale biobanks collect genetic and health data from hundreds of thousands of individuals, such as the UK Biobank.
  2. GWAS Analysis: Researchers perform GWAS to identify and weigh the effect of millions of SNPs across the entire genome for a specific trait, such as body mass index (BMI) or risk of heart disease.
  3. Weighted Summation: An individual's DNA is then analyzed for these specific variants. For each variant, the number of risk-associated alleles they carry is multiplied by the variant's effect size (weight). These weighted values are then summed to produce the final polygenic score.
  4. Reference Population Comparison: The score is interpreted in the context of the reference population used to calculate it, often displayed as a percentile. A score in the 90th percentile, for example, means an individual's genetic predisposition is higher than 90% of that reference population.

Understanding Polygenic vs. Monogenic Traits

It's crucial to distinguish between conditions driven by many genes and those caused by a single gene mutation. This table clarifies the key differences:

Feature Monogenic Trait/Condition Polygenic Trait/Condition (PGS)
Genetic Basis Single gene mutation, often with high penetrance (e.g., Huntington's Disease, cystic fibrosis). Many genetic variants (SNPs) across the genome, each with a small effect.
Prediction Genetic testing is often highly predictive and can confirm a diagnosis. Offers an estimate of relative risk; it's a probabilistic tool, not a diagnosis.
Effect of Environment Minimal environmental influence on the presence of the trait, though environment can impact disease severity. Significantly influenced by environmental factors (lifestyle, diet, etc.), making risk modification possible.
Risk Profile Often results in clear, predictable risk inheritance patterns within families. Generates a risk profile that is distributed across the general population in a bell-curve fashion.

What a Polygenic Score Can and Cannot Tell You

A PGS is a powerful but nuanced tool. It can provide valuable information about your genetic likelihood for certain health conditions, helping to inform conversations with your doctor about preventative strategies and targeted screening. For example, a high PRS for coronary artery disease might encourage earlier or more frequent monitoring.

However, it's vital to remember that a PGS does not determine your destiny. It's a statistical estimate of genetic risk, not an absolute predictor. Your lifestyle, environment, and other clinical factors play an equally important, if not greater, role in your overall health outcomes. A person with a high genetic risk for a condition may never develop it due to a healthy lifestyle, while someone with a low score might develop it due to other factors.

The Clinical Potential of Polygenic Scores

The potential applications of PGS in clinical practice and research are significant:

  1. Personalized Screening Recommendations: High PRS for conditions like breast cancer or heart disease could trigger personalized screening protocols, such as starting mammograms or cholesterol checks earlier.
  2. Refined Diagnoses: In cases where symptoms overlap, a PGS could help differentiate between conditions. This has been explored for distinguishing Type 1 from Type 2 diabetes and for certain psychiatric diseases.
  3. Improved Clinical Trials: Clinical trials could use PGS to identify individuals most likely to benefit from a particular drug, potentially increasing treatment efficacy.
  4. Population Health Research: Scientists use PGS to understand the genetic architecture of diseases, identify shared genetic pathways between different conditions, and study gene-environment interactions.

Challenges and Limitations

Despite its promise, the clinical use of polygenic scores faces several hurdles. A major concern is the existing bias in the genetic data used to create the scores. Historically, most GWAS data has come from populations of European ancestry, meaning PGS may be less accurate and reliable for individuals of non-European descent. This raises critical issues of health equity and necessitates efforts to build more diverse genetic databases. Furthermore, there is the risk of misinterpretation, both by consumers and some healthcare providers, who may view the score as deterministic rather than probabilistic. Clear communication and genetic counseling are essential to ensure the information is used responsibly.

The Future of Genomics and Health

The field of genomics is advancing rapidly, and with it, the potential for polygenic scores to provide more precise and actionable health insights. Ongoing research focuses on incorporating more diverse populations and developing more sophisticated algorithms that can account for different genetic architectures. As these technologies mature, they will increasingly be integrated with other forms of medical data, such as electronic health records and lifestyle information, to create a more holistic view of an individual's health. The goal is to move beyond simply predicting risk to actively informing and improving health outcomes through personalized prevention and treatment strategies. For more technical information, the National Institutes of Health offers a comprehensive guide on polygenic scores.

Conclusion

In summary, a polygenic score represents a significant step forward in our ability to use genomic information for predictive health. By aggregating the effects of millions of small genetic variations, it offers a relative measure of an individual's genetic predisposition for complex traits and diseases. However, its probabilistic nature and the crucial influence of environmental factors mean it should be viewed as one piece of a much larger health puzzle. As research progresses and addresses current limitations, particularly regarding population diversity, polygenic scores will become an even more powerful tool for empowering individuals and clinicians in the pursuit of personalized preventative care.

Frequently Asked Questions

The accuracy of a polygenic score varies depending on the trait or disease and the population being studied. While predictive power has increased significantly, it is not a perfect predictor and is often less accurate in populations with different genetic ancestry than the study's reference group.

No, your polygenic score is based on the genetic code you are born with and cannot be changed. However, your behavior and lifestyle choices can significantly influence whether a genetic risk is realized.

Polygenic score reports are offered by some direct-to-consumer genetic testing companies. However, they are not yet a standard part of clinical care in most health systems, and accessibility may depend on the specific condition and a person's ancestry.

While both use genetic data, ancestry predictions use DNA markers to trace lineage and geographic origins. Polygenic scores use a different set of variants and statistical models to predict a specific trait or disease risk, rather than ancestry.

The terms are often used interchangeably. 'Polygenic score' (PGS) is a general term for any polygenic trait, while 'polygenic risk score' (PRS) specifically refers to the assessment of risk for a disease or medical condition.

Polygenic scores are primarily designed for common, complex diseases that are influenced by many genes, each with a small effect. Rare, monogenic diseases caused by a single gene variant are typically identified through other forms of genetic testing.

A primary limitation is the ancestry bias in the data used for their creation. The majority of research has been on individuals of European descent, leading to reduced accuracy and utility of scores in non-European populations and contributing to healthcare disparities.

References

  1. 1
  2. 2
  3. 3
  4. 4
  5. 5

Medical Disclaimer

This content is for informational purposes only and should not replace professional medical advice.