pyDRMetrics

pyDRMetrics - A Python toolkit for dimensionality reduction quality assessment, Heliyon, Volume 7, Issue 2, 2021, e06199, ISSN 2405-8440, https://doi.org/10.1016/j.heliyon.2021.e06199. (https://www.sciencedirect.com/science/article/pii/S2405844021003042)

A more friendly GUI tool using pyDRMetrics can be accessed at http://spacs.brahma.pub/research/DR

File list:

src/pyDRMetrics.py - the main module
src/other py files - dependent modules
data/ovarian-cancer-nci-pbsii-data-no-header.csv - SELDI-TOF-MS dataset used in the case study. 253 samples. Each sample has 15154 dimensions.
data/cancer.csv - A subset of ovarian-cancer-nci-pbsii-data containing 10 normal and 10 cancer samples. DOI: 10.1016/S0140-6736(02)07746-2
data/digits.csv - 40 samples from the MNIST handwritten digits dataset. URL: http://yann.lecun.com/exdb/mnist/
data/raman.csv - Another dataset containing the Raman spectra of 46 infant formula milk powder samples. DOI: 10.1016/j.talanta.2019.120681
doc.pdf - the code and result for the case study

Installation

pip install pyDRMetrics
Install the R runtime. Then install the ECoL package to the R environment.

How to use this package (with sample code):

Download any sample dataset from the /data folder
Use the following sample code to use the package:

# import the library
from pyDRMetrics.pyDRMetrics import *

# load the dataset
import pandas as pd
data = pd.read_csv('raman.csv')
cols = data.shape[1]
# convert from pandas dataframe to numpy matrices
X = np.array(data.iloc[:,1:-1]) # skip first and last cols
y = np.array(data.iloc[:,-1])
X_names = list(data.columns.values[1:-1]) # -1 for removing the last column
labels = list(set(y))

# perform DR, e.g., PCA
from sklearn.decomposition import PCA
import matplotlib.ticker as mticker
K = 2
pca = PCA(n_components = K) # keep the first K components
pca.fit(X)
Z = pca.transform(X)
Xr = pca.inverse_transform(Z)

# Create DRMetrics object. This object contains all DR metrics and main API functions
drm = DRMetrics(X, Z, Xr)
drm.report() # this will generate a detailed report. You can also access each metric, e.g., drm.QNN, drm.LCMC, etc.

You may also check doc.pdf for more sample codes.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
data		data
src/pyDRMetrics		src/pyDRMetrics
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
doc.pdf		doc.pdf
install_ECoL.png		install_ECoL.png
licence.txt		licence.txt
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

pyDRMetrics

Installation

How to use this package (with sample code):

About

Licenses found

Releases 1

Packages

Contributors 2

Languages

License

Licenses found

zhangys11/pyDRMetrics

Folders and files

Latest commit

History

Repository files navigation

pyDRMetrics

Installation

How to use this package (with sample code):

About

Resources

License

Licenses found

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages