Classification of Protein Structure (RMSD <= 6A˚) using Physicochemical Properties

Sonal Mishra; Yadunath Pathak; Anamika Ahirwar

Classification of Protein Structure (RMSD <= 6A˚) using Physicochemical Properties

원문정보

Sonal Mishra, Yadunath Pathak, Anamika Ahirwar

보안공학연구지원센터(IJBSBT) International Journal of Bio-Science and Bio-Technology Vol.7 No.6 2015.12 pp.141-150 SCOPUS

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

The quality of the protein structure can be determined by physical and chemical properties, therefore it has been used to distinguish native or native like structure from other predicted structures. In this study, the machine learning classification models are explored with six physical and chemical properties to classify the root mean square deviation (RMSD) of the protein structure in absence of its true native state and each protein structure lies between 0A˚ to 6A˚ RMSD space. Physical and chemical properties used in this paper are total surface area, Euclidean distance, total empirical energy, secondary structure penalty, residue length, and pair number. There are total 24294 decoys, having 4919 native structures. Artificial bee colony algorithm is used to determine the feature importance. The K-fold cross validation is used to measure the robustness of the best classification model. The results show that random forest method outperforms other machine learning models in the classification of protein structure prediction with sensitivity of 0.72 and accuracy of 70.33% on testing data set. The data set used in the study is available at http://bit.ly/RMSD-Classification-DS.

키워드

저자정보

Sonal Mishra Maharana Pratap College of Technology TechnologyGwalior - 474006, India
Yadunath Pathak Maharana Pratap College of Technology TechnologyGwalior - 474006, India
Anamika Ahirwar ABV-IIITM, Gwalior - 474015, India Gwalior - 474006, India

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

0개의 논문이 장바구니에 담겼습니다.

earticle