Science

A brief introduction to AlphaFold

A beginner's introduction to AlphaFold, the AI from DeepMind that predicts protein structure. The first article in a longer series about bioinformatics.

FELIX

22 Oct 2021 — 2 min read

In July earlier this year, the code, methodology, and database behind AlphaFold, the protein prediction AI software developed by DeepMind, was made open source through the publication of two articles in Nature.

AlphaFold is a major advancement in the quest to predict a protein’s structure from its sequence alone. In nature, proteins reliably fold into precise 3D conformations that is critical for its function based on nothing more than the sequence of amino acids that it is composed of. In fact, mutations in proteins that lead to misfolding are often associated with disease states, for example, Alzheimer’s and Parkinson’s. However, we have not been able to understand this folding process nor predict the 3D shape of a protein based on its sequence alone.

Although we have currently found sequences for millions of proteins, we have only solved the structures of about 180,000 of them. Structural biology techniques have been developed to solve structures experimentally: x-ray crystallography, nuclear magnetic resonance, and cryo-electron microscopy. These methods involve large amounts of trial and error and have been limited in the complexity of proteins they can be applied to. Outside the lab, computational methods have been developed to predict how a protein may fold based on its sequence to bypass the experimental resources. However, these traditionally relied on using templates from experimentally-solved structures, which then imposes the same limits on the range of proteins they work best for.

AlphaFold 2, which uses deep learning algorithms to predict structure to atomic accuracy (within 1 Å or 0.1 nm of error), has been the most successful computational approach so far. In brief, AlphaFold operates with three main parts. The first involves constructing an initial model for which amino acids may be in contact with each other in the folded protein. Second, it uses a machine learning method called attention to interpret which parts of the model are informative, it takes the informative parts of the model to reconstruct an improved model for amino acid contacts, and the improved model is reinterpreted. This process occurs iteratively for a number of cycles, then the final improved model is fed through the third part which produces the 3D model of the protein. The software will feed the predicted 3D structure back into the second step, and this loop occurs several times for the model to be refined.

The final output of AlphaFold is a file containing the 3D coordinates for every non-hydrogen atom in the protein. It also outputs a graph showing the confidence levels for every amino acid residue, which allows users to assess the reliability of the predicted structure.

AlphaFold is an outstanding contribution to the field of bioinformatics. In the most recent blind assessment of structure prediction software (the CASP14 initiative), it significantly outperformed competing approaches. It is considered to be the closest we’ve gotten to solving the structure prediction problem.

Science

Issue 1778

From Issue 1778

8th Oct 2021

Discover stories from this section and more in the list of contents

Explore the edition

Against rationality

Science writer Leo Zhang explores the 1975 science philosophy book, “Against Method”, by Paul Feyerabend

Mosquito Close Up With Shades Of Yellow Background 2021 09 01 03 47 24 Utc

This week in Science (21-10-15)

A weekly summary of interesting headlines in Science and Technology

The 2021 Nobel Prize in Physics

Felix Science covers the achievements that won scientists the Nobel Prize this year.

The 2021 Nobel Prize in Chemistry

Felix Science covers the achievements that won scientists the Nobel Prize this year.

Environment

Exposed: Imperial’s FFI partners don’t care about a green transition

Felix investigates whether the seven fossil fuel companies selected during the first round of the Imperial Zero Index assessment really have a “strong strategic intent to decarbonise.”

Catnip

The Felix Dating Guide, part 1: from LateX to Love

In between course work, lectures and the occasional crisis, dating at Imperial can feel impossible. However, do not fear, we at Felix (naturally) are here to help in this problem that can’t quite be solved by your graphing calculator. We have compiled a guide below, with more 5 ⭐ reviews

Books

Books for when...

Valentine's themed recommendations

Books

L’Écume des jours

My admission that Boris Vian’s L’Écume des jours (Froth on the Daydream) is my favourite book has often raised eyebrows. This 1947 novel is a classic mandatory back-to school read for French pupils, but is rarely considered a part of the adult canon. There is, indeed, something very

Science

From Issue 1778

Read more

Against rationality

This week in Science (21-10-15)

The 2021 Nobel Prize in Physics

The 2021 Nobel Prize in Chemistry

Exposed: Imperial’s FFI partners don’t care about a green transition

The Felix Dating Guide, part 1: from LateX to Love

Books for when...

L’Écume des jours