An artificial intelligence group says its program has predicted the structure of almost every protein known to science – effectively solving one of biology’s “grand challenges” and paving the way for new discoveries and technologies in fields as diverse as medicine, food safety and climate science.
DeepMind, an AI firm owned by Google’s parent company Alphabet, announced Thursday that its AlphaFold program has expanded its open online database to include more than 200 million protein structures.
The vast catalog now encompasses “the entire protein universe,” DeepMind CEO Demis Hassabis said in a news briefing — from the sequenced genomes of nearly every organism on the planet.
Proteins are long, complex chains of amino acids that make up the building blocks of life. Scientists have long sought to unravel how these chains are elegantly twisted and folded into 3D shapes because understanding their structure can provide valuable insight into their function. For example, knowing a protein’s specific shape and how its various molecules interact can help researchers narrow down potential targets for medical treatments.
AlphaFold’s upgraded database includes protein structures for plants, bacteria, animals and other organisms, according to DeepMind.
These updates provide “new opportunities for researchers to use AlphaFold to advance work on important issues, including sustainability, food insecurity and neglected diseases,” Hassabis wrote in a blog post published Thursday about the milestone.
“By demonstrating that AI could accurately predict the shape of a protein down to atomic precision, at scale and in minutes, AlphaFold not only provided a solution to a 50-year grand challenge, it also became the first major proof of our fundamental thesis: that artificial intelligence can dramatically accelerate scientific discovery, and in turn advance humanity,” he wrote.
AlphaFold was introduced in 2020, and last year DeepMind wowed the scientific community by unveiling a catalog of structures that included virtually every protein in the human body. The so-called AlphaFold Protein Structure Database, built in collaboration with the European Molecular Biology Laboratory, included hundreds of thousands of newly predicted protein structures.
The wealth of information is already being used by researchers around the world to study topics ranging from antibiotic resistance to plastic pollution, according to Hassabis.
For example, researchers at the University of Portsmouth in the UK announced in July 2021 that they are using the database to help engineer enzymes for recycling certain types of single-use plastics.
“AlphaFold provides us with an exciting new library of templates to engineer faster, more stable and cheaper enzymes for plastic recycling,” John McGeehan, director of the University of Portsmouth’s Center for Enzyme Innovation, said in a statement at the time.
Hassabis said DeepMind is working to further expand its database, with particular emphasis on applications related to drug development, basic biology research, climate science, quantum chemistry and fusion.
“AlphaFold is a glimpse of the future,” he wrote, “and what might be possible with computational and AI methods applied to biology.”