Skip to content
View dsa-shua's full-sized avatar
🖥️
Hustling
🖥️
Hustling

Block or report dsa-shua

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Extremely simple yet powerful header-only C++ plotting library built on the popular matplotlib

C++ 4,384 1,133 Updated Nov 21, 2023

Study parallel programming - CUDA, OpenMP, MPI, Pthread

Cuda 55 15 Updated Jul 3, 2022

Processing-In-Memory (PIM) Simulator

C++ 113 41 Updated Jul 9, 2024

A GPU performance profiling tool for PyTorch models

Python 494 46 Updated Jul 13, 2021

LLM inference in C/C++

C++ 66,224 9,520 Updated Oct 14, 2024

Modified version of PyTorch able to work with changes to GPGPU-Sim

C++ 45 25 Updated Nov 18, 2022

Xilinx Embedded Software (embeddedsw) Development

HTML 936 1,067 Updated Jul 26, 2024

Serving multiple LoRA finetuned LLM as one

Python 966 45 Updated May 8, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,754 455 Updated May 3, 2024

GPT-3: Language Models are Few-Shot Learners

15,669 2,298 Updated Sep 18, 2020

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 22,371 5,505 Updated Aug 14, 2024

Build and run containers leveraging NVIDIA GPUs

Go 2,332 251 Updated Oct 13, 2024

GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as…

C++ 1,103 505 Updated Aug 21, 2024

NPUsim: Full-system, Cycle-accurate, Value-aware NPU Simulator

C++ 13 3 Updated Jul 15, 2024

NeuroSpector: Dataflow and Mapping Optimization of Deep Neural Network Accelerators

C++ 16 3 Updated May 9, 2024

Demonstration of a video processing design for the Digilent Zybo, using Web Camera for input and VGA interface for output.

VHDL 24 10 Updated Aug 28, 2016

An esolang in TypeScript, for heaven's sake.

TypeScript 562 34 Updated Aug 5, 2024

Magic VLSI Layout Tool

C 482 100 Updated Oct 11, 2024

RISC-V Proxy Kernel

C 592 308 Updated Oct 8, 2024
Scala 3 Updated May 15, 2020

Microarchitecture implementation of the decoupled vector-fetch accelerator

Scala 147 42 Updated Jan 25, 2024

A Rocket-based RISC-V superscalar in-order core

Scala 26 2 Updated Oct 3, 2024

Kite: Architecture Simulator for RISC-V Instruction Set

C++ 14 3 Updated Mar 22, 2024

Flexible Intermediate Representation for RTL

Scala 724 176 Updated Aug 20, 2024

A conda-forge distribution.

Shell 6,304 325 Updated Oct 12, 2024

Rocket Chip Generator

Scala 3,208 1,122 Updated Oct 8, 2024

An Agile RISC-V SoC Design Framework with in-order cores, out-of-order cores, accelerators, and more

Scala 1,607 637 Updated Oct 14, 2024

And Twitter API library for the ESP32 that can tweet

C++ 22 1 Updated May 8, 2023