Skip to content

A program that compares a protein structure prediction to a solved structure and evaluates the prediction's accuracy using RMSD.

Notifications You must be signed in to change notification settings

rsrchen/ecs129-project

Repository files navigation

ECS 129 Winter Quarter 2022 Project - Protein Structure Comparison Program

What

This program takes a set of multiple protein structures, where each protein structure is given by a text file containing the (x, y, z) coordinates of the protein's alpha carbons, and compares all of the structures to one another by calculating the root-mean-square deviation, or RMSD, between pairs of structures. Pretty cool.

More what

The program functions via CLI. It takes five protein structure predictions (i.e. from AlphaFold) and one experimentally determined structure (i.e. as empirically derived from crystallography or cryo-EM). The program also shows you a heatmap generated from the RMSDs of all comparisons of all protein structures and serves you a bar graph giving you a closer look at how the protein structure predictions stack up against one another when compared to the experimentally-determined structure.

How

We use the quaternion-based RMSD algorithm provided by Coutsias, Seok, and Dill in this paper. We also graciously use the open-source community packages numpy and matplotlib. Thanks. We love you.

Why

The science around protein structure prediction is, and has for decades been, absolutely booming. People around the globe are working on computational methods for protein structure prediction day in and day out. You've heard about CASP, the major biannual research competition dedicated to 3D protein structure work. You've heard about AlphaFold, the major recent breakthrough in structure prediction. Everyone's talking about this stuff. It's changing the future. The point is—it's a huge area of research, and it's incredibly interesting. That's why we chose to focus on it for our ECS 129 project and write a program that evaluates just how good a protein structure prediction really is by comparing it to the empirically-determined true structure.

Who

  • Raymond Chen - programming, debugging, refactoring, report writing, writing this readme
  • Emily Cheng - programming, report writing
  • Daniel Cardenas - programming, report writing

Report

You may have noticed credit for "report writing" in the previous section. We wrote a report on this program for ECS 129 - Computational Structural Bioinformatics. Took the class winter quarter 2022 with Professor Patrice Koehl, who I think is a great person and a very passionate instructor. Here's a link to the original report. This report was written when the project was in a different state than it is now, before I merged my own branch containing my own individual work with the master branch, so some stuff (such as the presence of the CLI) might be different.

About

A program that compares a protein structure prediction to a solved structure and evaluates the prediction's accuracy using RMSD.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages