Answer any of the following questions by Shuguang Zhang:
Since 1 gr is around 6.022e+23 Daltons. Assuming a piece of meat has no fat and only protein will mean such piece of meat will have on average 3.011e+24 amino acids
Humans do not become a cow or a fish because when proteins from food are digested, enzymes in the stomach and small intestine break them down into individual amino acids. These amino acids are universal building blocks shared by all living organisms. The human body then reassembles them into new proteins according to human genetic instructions, rather than those of the cow or fish
The set of 20 standard amino acids is largely determined by the constraints of genetic encoding and evolutionary stability. The genetic code is optimized to efficiently translate triplet codons into a limited but diverse set of amino acids that provide chemical functionality while ensuring accurate and reliable protein synthesis. The available tRNA and ribosome machinery support these 20 amino acids in a way that balances diversity, stability, and efficiency in biological systems.
.
Yes, non-natural amino acids can be synthesized through chemical modifications or by engineering tRNA and ribosome systems to incorporate them into proteins. Some modifications include:
Amino acids likely formed naturally on early Earth from simple molecules present in the primordial environment. Experiments, such as the Miller-Urey experiment, have demonstrated that lightning, UV radiation, and other energy sources could drive the formation of amino acids from gases like methane, ammonia, and water. Additionally, some theories suggest that amino acids may have arrived on meteorites, as certain extraterrestrial samples contain amino acids.
A left-handed helix would form because D-amino acids are the mirror image of the naturally occurring L-amino acids, which typically form right-handed alpha-helices.
Yes, in addition to alpha-helices, other helical structures such as pi-helices and 3_10 helices exist. These shorter or less common helices can be identified using advanced structural analysis techniques, including X-ray crystallography, cryo-EM, and AI-based structure prediction tools like AlphaFold.
Most naturally occurring amino acids are L-amino acids, which inherently favor right-handed helices. This handedness is also energetically favorable, as it optimizes hydrogen bonding and steric interactions, leading to more stable protein structures.
Beta-sheets tend to aggregate because their structure allows for extensive intermolecular hydrogen bonding, leading to the formation of large, stable sheet-like assemblies. The driving force behind this aggregation is often the presence of hydrophobic amino acids, which promote hydrophobic interactions and further stabilize the aggregated state.
Amyloid diseases involve beta-sheets because misfolded proteins can adopt an energetically stable beta-sheet conformation, leading to fibrillar aggregates that accumulate in tissues. These fibrils are highly ordered and resistant to degradation, contributing to diseases like Alzheimer’s and Parkinson’s.
Yes, amyloid beta-sheets can be used as materials due to their high stability and self-assembling properties. They have been explored in tissue engineering, drug delivery, and biomaterials applications.
In this part of the homework, you will be using online resources and 3D visualization software to answer questions about proteins.
Pick any protein (from any organism) of your interest that has a 3D structure and answer the following questions.
I picked Ferritin protein which play a crucial role in iron storage and metabolism. I chose it because it looks like a promising protein which could be used as a self-assembled liposome structure I could use for my final project of bio-sponge decorated with nano-body specific to the target which we are trying to remove. I specifically chose human Ferritin light chain for ease of design in my future application.
Identify the amino acid sequence of your protein.
The Ferritin light chain has 175 amino acids; Leucine (Leu, L) is the most frequent amino acid with 27 appearances.
Identify the structure page of your protein in RCSB
Structure was solved in 2006-07-11; it had resolution of 1.90 Å
There are two ligands mentioned: CADMIUM ION and SULFATE ION
It belongs the the iron binding protein family
Open the structure of your protein in any 3D molecule visualization software:
cartoon

ribbon

ball

sticks

The protein seem to have helices but no sheets



It looks like there pockets which can be used for iron (possibly)
<aside> <img src="/icons/snippet_lightgray.svg" alt="/icons/snippet_lightgray.svg" width="40px" /> Resources: https://colab.research.google.com/drive/1hXStRY9VCyw52n17uWdWQBj__IcR2ztK?usp=sharing#scrollTo=38gFJBazNdzJ
</aside>
Using alphafold server I was able to put it the sequence of 175 amino acids and go the following output:


it looks like the majority of the structure has high confidence (in blue) showing alpha helices matching with the experimental protein. The low confidence regions are likely regions of less stability where alpha helices which form a stable structure are higher in confidence.
ESM fold

Inverse-fold your structure with ProteinMPNN
MSSQIRQNYSTDVEAAVNSLVNLYLQASYTYLSLGFYFDRDDVALEGVSHFFRELAEEKREGYERLLKMQNQRGGRALFQDIKKPAEDEWGKTPDAMKAAMALEKKLNQALLDLHALGSARTDPHLCDFLETHFLDEEVKLIKKMGDHLTNLHRLGGPEAGLGEYLFERLTLKHD
When running pBLAST with the original sequence it is showing perfect macth- ProteinMPNN’s default behavior (T=0) already confirms that the original sequence is well-fitted to the structure. Higher temperatures allow for more divergence from the original sequence but the default behavior is good at predicting the original sequence.