Comprehensive study using bioinformatics predicts the molecular causes of many genetic diseases
Research spearheaded at Buck Institute results in a web-based tool available to other scientists
"We now have a quantitative model of function using bioinformatic methods that can predict things like the stability of the protein and how its stability is disrupted when a mutation occurs," said Buck Institute faculty member Sean Mooney, PhD, who led the research team. "Traditionally people have used a very time consuming process based on evolutionary information about protein structure to predict molecular activity," Mooney said, "I think we're the first group to really quantitatively describe the universe of molecular functions that cause human genetic disease."
The research was done in the contexts of inherited single gene diseases, complex diseases such as cardiovascular and developmental disorders and mutations in cancerous tumors. The study focused on amino acid substitutions (AAS), which are genetically driven changes in proteins that can give rise to disease, and utilized a series of complex mathematical algorithms to predict activity stemming from the mutations.
As a first step, researchers used available databases of known sites of protein function and built mathematical algorithms to predict new sites of protein function said Mooney. They then applied the algorithms to proteins that have disease-associated mutations assigned to them and looked for statistical co-occurrences of mutations that fell in or near those functional sites. Because the computer algorithms are imperfect, researchers compared that information against a data set of neutral AAS, ones that don't cause human disease, said Mooney. "We looked for statistical differences between the percentage of mutations that fell into the same functional site from both non-disease and disease-associated AAS and looked to see if there was a statistically significant enrichment or depletion of protein activity based on the type of AAS . That data was used to hypothesize the molecular mechanism of genetic disease," said Mooney.
Mooney says 40,000 AAS were analyzed which represents one of the most comprehensive studies of mutations. Describing the results, he used the analogy of a car as a protein - a big molecular machine. "We are predicting how this machine will break down," said Mooney. "We've known the car isn't working properly because it has some defect; now we can hypothesize that the symptom stems from a broken water pump."
Mooney identified three different areas of research that could be furthered by use of the tool. Scientists who manage databases of clinically observed mutations for research purposes could develop hypotheses about what those mutations are causing on a molecular level; they may also be able to use the tool to correlate molecular activity to the clinical severity or subtype of a disease. Mooney says cancer researchers re-sequencing tumors could use the tool to identify mutations that drive the progression of the malignancy. He also expects non-clinical researchers who work with mutations in proteins to use the tool to gain insight into what is causing the mutations. "We are happy to collaborate with scientists, to share data and help them better identify hypotheses about the specific mutations they might be interested in," said Mooney.
Topics
Organizations
Related link
Other news from the department research and development
Get the life science industry in your inbox
From now on, don't miss a thing: Our newsletter for biotechnology, pharma and life sciences brings you up to date every Tuesday and Thursday. The latest industry news, product highlights and innovations - compact and easy to understand in your inbox. Researched by us so you don't have to.