Skip to content
View dratman's full-sized avatar
  • Maximum Software
  • New Jersey, USA
  • 21:50 (UTC -05:00)

Highlights

  • Pro

Block or report dratman

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI

Python 270 19 Updated Nov 7, 2024

The Python <-> Objective-C Bridge with bindings for macOS frameworks

Python 593 51 Updated Jan 14, 2025

see github.com/understanding-search/maze-transformer

Jupyter Notebook 9 1 Updated Dec 8, 2023

Exploring the minimal architecture required for coherent English language generation.

Python 3 2 Updated Aug 19, 2023

Creating a mini GPT-2 model from scratch by training it with data obtained from TinyStories

Jupyter Notebook 1 Updated Sep 15, 2023

A 2M parameter neural language model trained on the TinyStories corpus.

Python 2 Updated Oct 3, 2023

Character Level Small Language Model trained on TinyStories datasets

Python 1 Updated Jul 9, 2023

Collection of experiments related to small language models, mostly seq2seq models

Python 9 Updated Jan 13, 2025

Reproduction of TinyStories: How Small Can Language Models Be and Still Speak Coherent English?

Python 3 1 Updated May 29, 2023

code to train a gpt-2 model to train it on tiny stories dataset according to the TinyStories paper

Jupyter Notebook 37 6 Updated Nov 24, 2023
Python 1 Updated Apr 22, 2024

A Mathematica and Matlab toolboxes for Clifford algebras of n-dimensional Euclidean vector spaces

Mathematica 11 1 Updated Jan 24, 2018