- San Francisco, CA
- https://AboutDataScience.wordpress.com
Stars
Compilation of resources for aspiring data scientists
Code for AMIA CRI 2016 paper "Learning Low-Dimensional Representations of Medical Concepts"
System for Medical Concept Extraction and Linking
Extract CUIs from MIMIC notes and represent them using cui2vec
A probabilistic programming language in TensorFlow. Deep generative models, variational inference.
Python code for part 2 of the book Causal Inference: What If, by Miguel Hernán and James Robins
Synthetic Patient Population Simulator
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
Scrape job websites into a single spreadsheet with no duplicates.
NYC WiMLDS scikit-learn open source sprint (Aug 24, 2019)
Python suite to construct benchmark machine learning datasets from the MIMIC-III 💊 clinical database.
Generative adversarial network for generating electronic health records.
Semantic segmentation on aerial and satellite imagery. Extracts features such as: buildings, parking lots, roads, water, clouds
Simple PyTorch Tutorials Zero to ALL!
A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks
Introduction to NLP with PyTorch Workshop Project
💡 Looking for inspiration for your next open source project? Or perhaps you've got a brilliant idea you can't wait to share with others? Open Source Ideas is a community built specifically for this! 👋
Open or Easy Access Clinical Data Sources for Biomedical Research
MIMIC Code Repository: Code shared by the research community for the MIMIC family of databases
Cluster Similar Customers of a Retailer using Machine Learning.
A curated list of awesome network analysis resources.
An introduction to network analysis and applied graph theory using Python and NetworkX