mttk

Martin Tutek mttk

Postdoc at Technion | previously postdoc @UKPLab, TU Darmstadt | NLP PhD @unizg

83 followers · 60 following

Achievements

x2 x2

Achievements

x2 x2

Organizations

Stars

facebookresearch / multiloko

A benchmark with locally sourced multilingual questions for 31 languages.

Python 7 2 Updated Apr 15, 2025

allenai / olmocr

Toolkit for linearizing PDFs for LLM datasets/training

Python 11,847 807 Updated Apr 23, 2025

technion-cs-nlp / parametric-faithfulness

Jupyter Notebook 7 Updated Mar 11, 2025

dzhng / deep-research

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 15,711 1,612 Updated Apr 12, 2025

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,490 260 Updated Apr 10, 2025

deepseek-ai / DeepSeek-R1

88,771 11,476 Updated Apr 9, 2025

deepseek-ai / DeepSeek-V3

Python 95,992 15,610 Updated Apr 9, 2025

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 1,188 50 Updated Nov 16, 2024

aalok-sathe / surprisal

A unified interface for computing surprisal (log probabilities) from language models! Supports neural, symbolic, and black-box API models.

Python 39 10 Updated Dec 17, 2024

valeman / awesome-conformal-prediction

A professionally curated list of awesome Conformal Prediction videos, tutorials, books, papers, PhD and MSc theses, articles and open-source libraries.

691 56 Updated Apr 21, 2025

alexwarstadt / blimp

The Benchmark of Linguistic Minimal Pairs

Python 150 13 Updated Dec 13, 2022

EurekaLabsAI / ngram

The n-gram Language Model

C 1,416 100 Updated Aug 5, 2024

pytorch / torchtune

PyTorch native post-training library

Python 5,116 578 Updated Apr 23, 2025

stanford-futuredata / ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,343 417 Updated Apr 7, 2025

google-deepmind / uncertain_ground_truth

Dermatology ddx dataset, Jax implementations of Monte Carlo conformal prediction, plausibility regions and statistical annotation aggregation from our recent work on uncertain ground truth (TMLR'23…

Python 647 44 Updated Mar 28, 2024