Protein Complex Prediction with AlphaFold-Multimer

Type: paper Slug: protein-complex-prediction-with-alphafold-multimer—hassabis Sources: protein-complex-prediction-with-alphafold-multimer—hassabis Last updated: 2026-05-13


Summary

Evans, O’Neill, Pritzel, Antropova, Senior, Green, Žídek, Bates, Blackwell, Yim, Ronneberger, Bodenstein, Zielinski, Bridgland, Potapenko, Cowie, Tunyasuvunakool, Jain, Clancy, Kohli, Jumper, and Hassabis (2021) introduced AlphaFold-Multimer, an extension of AlphaFold2 that predicts the structures of protein complexes (multi-chain assemblies). The system modifies the AlphaFold2 architecture to handle multiple protein chains, predicting both intra-chain and inter-chain contacts to produce full complex structures. This addressed a major limitation of the original AlphaFold2.

Core content

Problem: AlphaFold2 predicts single-chain protein structures with high accuracy but cannot natively predict how multiple proteins assemble into complexes. Most biological functions involve protein complexes, not isolated chains (paper—protein-complex-prediction-with-alphafold-multimer §Introduction).

Key modifications to AlphaFold2:

  • Input representation: Extended to handle multiple sequences (one per chain) with explicit chain identifiers in the MSA and pair representations (paper—protein-complex-prediction-with-alphafold-multimer §Methods).
  • Pair representation: Includes inter-chain residue pairs (not just intra-chain), allowing the model to predict chain-chain interactions (paper—protein-complex-prediction-with-alphafold-multimer §Methods).
  • Training data: Curated set of experimentally determined protein complex structures from the PDB (paper—protein-complex-prediction-with-alphafold-multimer §Methods).
  • Assembly module: Predicts stoichiometry and chain ordering for heteromeric complexes (paper—protein-complex-prediction-with-alphafold-multimer §Methods).

Results:

  • Achieved higher accuracy than dedicated docking methods on heterodimer benchmarks (paper—protein-complex-prediction-with-alphafold-multimer §Results).
  • Successfully predicted structures of complexes with up to ~10 chains (paper—protein-complex-prediction-with-alphafold-multimer §Results).
  • Performance was strongest for complexes where good MSAs were available; degraded for complexes with few homologs (paper—protein-complex-prediction-with-alphafold-multimer §Results).

Connections- Theme: theme—protein-folding, structural-biology

  • Project: AlphaFold-Multimer
  • Collaborators: Richard Evans (co-first), Michael O’Neill (co-first), Alexander Pritzel (co-first), Natasha Antropova (co-first), John Jumper (co-first), Andrew Senior, Tim Green, Pushmeet Kohli
  • Era: alphafold-era
  • Venue: venue—bioRxiv (preprint, not peer-reviewed)
  • Extends: paper—highly-accurate-protein-structure-prediction-with---hassabis — AlphaFold-Multimer builds on AlphaFold2

Honest Gaps

  • Metadata lists only 4 co-authors; the actual paper has ~22 authors.
  • This is a bioRxiv preprint — it has not undergone formal peer review.
  • Performance on large complexes (>5 chains) and transient interactions is limited.
  • The system cannot predict complexes involving nucleic acids, lipids, or small molecules.
  • Assembly prediction (which chains form a complex and in what stoichiometry) remains a challenge — the system requires the input chains to be specified.
  • Like AlphaFold2, performance depends heavily on MSA depth — complexes with few evolutionary homologs are poorly predicted.