QSAR-COVID-19-App

Try it as a web app here https://sars-covid-3cl-prediciton.streamlit.app/

App framework from Python + Streamlit by Chanin Nantasenamat (aka Data Professor)
Descriptor calculated using PaDEL-Descriptor [Read the Paper].
References 1 https://github.com/dataprofessor/bioactivity-prediction-app
Reference 2 https://github.com/quantaosun/QSAR-COVID-19
For convenience purposes, what is provided is actually SARS-COV 3C-like protease, not SARS-COV-2 due to a lack of enough datapoint for the latter. But it could be transformed into a COVID-19 model at any time when have enough data.

Only works on Linux, and Mac

Screen.Recording.2023-07-09.at.10.12.18.AM.mov

Model training

Protein target

The model was built with 133 bioactivity data in the Chembl database in July 2023, with a random forest regression model.

Image from https://www.ebi.ac.uk/chembl/target_report_card/CHEMBL3927/

Model Performance

model = RandomForestRegressor(n_estimators=500, random_state=42)
model.fit(X, Y)
r2 = model.score(X, Y)
r2

gives an r2 = 0.8635050710434334

Predicted vs Experimental based on this model (note this is not an external prediction but all from the Chembl training set)

Run the app locally

pre-requirements

streamlit
sklearn

Java needed

On Linux run

sudo apt install default-jdk

It will go to PATH automatically

On Mac, run

brew install java and we need to put the path into PATH by

echo 'export PATH="/opt/homebrew/opt/openjdk/bin:$PATH"' >> ~/.zshrc
source ~/.zshrc

Then download this repo to your local machine and enter it inside

git clone https://github.com/quantaosun/QSAR-COVID-19-App.git
cd QSAR-COVID-19-App.git

launch the app by

streamlit run app.py

Your browser will open an interface for the APP

Next, you can click Upload a txt file containing the SMILES strings you want to predict, then click Predict will return the result at the bottom, You can name it whatever you like as long as the format is .txt. like,

c1ccccc1 benzene You are advised to predict one molecule at one time for the moment.

What happened after you clicked the predict button

1. The smiles will be converted into a binary string with 264 bits, the same length as our model expected
1. The binary string then will be allocated as variable matrix X
1. The X variable will be fed into our built model, and returns the Y value, which essentially is the pIC50

Name		Name	Last commit message	Last commit date
Latest commit History 108 Commits
PaDEL-Descriptor		PaDEL-Descriptor
Procfile		Procfile
README.md		README.md
app.py		app.py
covid19_model2.pkl		covid19_model2.pkl
covid_organism_353_model.pkl		covid_organism_353_model.pkl
descriptor_list.csv		descriptor_list.csv
descriptors_output.csv		descriptors_output.csv
example_covid19.txt		example_covid19.txt
logo.png		logo.png
packages.txt		packages.txt
regression.png		regression.png
requirements.txt		requirements.txt
runtime.txt		runtime.txt
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

QSAR-COVID-19-App

Try it as a web app here https://sars-covid-3cl-prediciton.streamlit.app/

Model training

Protein target

Model Performance

Predicted vs Experimental based on this model (note this is not an external prediction but all from the Chembl training set)

Run the app locally

Java needed

On Linux run

On Mac, run

What happened after you clicked the predict button

About

Releases

Packages

Languages

LingjieBao1998/QSAR-COVID-19-App

Folders and files

Latest commit

History

Repository files navigation

QSAR-COVID-19-App

Try it as a web app here https://sars-covid-3cl-prediciton.streamlit.app/

Model training

Protein target

Model Performance

Predicted vs Experimental based on this model (note this is not an external prediction but all from the Chembl training set)

Run the app locally

Java needed

On Linux run

On Mac, run

What happened after you clicked the predict button

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages