Lists (1)
Sort Name ascending (A-Z)
Starred repositories
A corpus of lyrics of Tamil Movie Songs
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
[EMNLP-2024] ⚓️ Sailor: Open Language Models for South-East Asia
jesman / fabric
Forked from danielmiessler/fabricfabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
This research focused on the case of customer payment defaults (credits) in Taiwan and compared the predictive accuracy of default probability among data mining methods.
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.
Simple text to phones converter for multiple languages
Tesseract Open Source OCR Engine (main repository)
jesman / llama3
Forked from meta-llama/llama3The official Meta Llama 3 GitHub site
jesman / tamil-llama
Forked from abhinand5/tamil-llamaA New Tamil Large Language Model (LLM) Based on Llama 2
A New Tamil Large Language Model (LLM) Based on Llama 2
VADER Sentiment Analysis. VADER (Valence Aware Dictionary and sEntiment Reasoner) is a lexicon and rule-based sentiment analysis tool that is specifically attuned to sentiments expressed in social …
Examples and guides for using the Gemini API
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
PyTorch original implementation of Cross-lingual Language Model Pretraining.
Contains data resources to replicate results from the paper “Re-contextualizing Fairness in NLP: The Case of India”.
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
Notebook to go along with a lecture for the MIT course 8.16: Data Science in Physics on neural simulation-based inference.