Protein Sequence Analysis using Generative Language Models

Centre for Smart Systems, AI and Cybersecurity (SSAICS)

This research will develop new embedding techniques for language models such as Bert to apply them in protein sequence analysis and classification of viral proteins.

This project aims to develop new techniques for protein sequence analysis using language models such as Bert, which have shown significant promise in improving accuracy and reducing computation time in biological applications. The limited availability of high-quality data is a challenge for training machine learning models, and to address this, new data augmentation techniques will be developed to generate synthetic data. The research will focus on developing new embedding techniques for language models that convert amino-acid sequences into vector representations, capturing both structural and functional traits of the sequences while preserving the integrity of the original data. This will enable the generation of more accurate and expressive sentence representations, important in protein sequence analysis. The developed techniques will be applied to classify viral proteins, focusing on influenza type A as a case study, which is essential for understanding the structure and function of the virus and developing effective treatments. The project aims to incorporate robust biological assumptions into these models and make them accessible to the wider scientific community. 

Funding

This PhD is offered on a self-funded basis 

Details of all postgraduate fees and funding can be found here
(Fees and finance - Staffordshire University (staffs.ac.uk)

Supervisory team

Dr Saeed Shiry Ghidary

Lecturer

I hold a Ph.D. in Robotics from Kobe University. With 20 years of teaching experience in AI and Robotics, I have published numerous papers. My research interests include Robotics, AI, machine learning, telerobotics, mobile robots, and theoretical ML

Saeed's profile

Course requirements

MSc in computer science or a related subject)

Experience of conducting health related research

How to apply

To apply for a self-funded PhD, please complete the Enquiry Form and clearly indicate which PhD project you are interested in 

Apply now

Contact Us

Saeed Shiry Ghidary

Lecturer

Start dates
Friday 30 June 2023
Saturday 30 September 2023
Contact
in the UK for Quality Education

Sustainable Development Goal 4, Times Higher Education Impact Rankings 2024

for Career Prospects

Whatuni Student Choice Awards 2023

for Facilities

Whatuni Student Choice Awards 2023

for Social Inclusion

The Times and The Sunday Times Good University Guide 2023

of Research Impact is ‘Outstanding’ or ‘Very Considerable’

Research Excellence Framework 2021

of Research is “Internationally Excellent” or “World Leading”

Research Excellence Framework 2021

Four Star Rating

QS Star Ratings 2021