-
Yonsei University
- Seoul
-
11:47
(UTC +09:00) - github.com/dsa-shua
Stars
Extremely simple yet powerful header-only C++ plotting library built on the popular matplotlib
Study parallel programming - CUDA, OpenMP, MPI, Pthread
A GPU performance profiling tool for PyTorch models
Modified version of PyTorch able to work with changes to GPGPU-Sim
xupgit / Advanced-Embedded-System-Design-Flow-on-Zynq
Forked from parimalp/Advanced-Embedded-System-Design-Flow-on-ZynqXilinx Embedded Software (embeddedsw) Development
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Code for the paper "Language Models are Unsupervised Multitask Learners"
Build and run containers leveraging NVIDIA GPUs
GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as…
NPUsim: Full-system, Cycle-accurate, Value-aware NPU Simulator
NeuroSpector: Dataflow and Mapping Optimization of Deep Neural Network Accelerators
Demonstration of a video processing design for the Digilent Zybo, using Web Camera for input and VGA interface for output.
Microarchitecture implementation of the decoupled vector-fetch accelerator
Kite: Architecture Simulator for RISC-V Instruction Set
Flexible Intermediate Representation for RTL
An Agile RISC-V SoC Design Framework with in-order cores, out-of-order cores, accelerators, and more
And Twitter API library for the ESP32 that can tweet