Skip to content
View lipracer's full-sized avatar
😀
Freebooks
😀
Freebooks
  • shanghai
  • 19:05 (UTC +08:00)

Organizations

@llvm

Block or report lipracer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Next generation BLAS implementation for ROCm platform

C++ 360 173 Updated Feb 5, 2025

[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl

C++ 4,945 757 Updated Feb 8, 2024

Fast and memory-efficient exact attention

Python 15,318 1,444 Updated Feb 4, 2025
C 1 Updated Jul 21, 2024

How do we integrate AI generation tools into actual work? | 关于 Ai 绘画的Wiki | Wiki about Ai painting | Prompts Engineering| 指南 Guide | Seeking Maintainer&Translator🙌

HTML 1,920 119 Updated Oct 31, 2024

Xv6 for RISC-V

C 7,549 2,770 Updated Sep 6, 2024
Python 16 1 Updated Dec 31, 2024

GNU toolchain for RISC-V, including GCC

C 3,710 1,201 Updated Jan 20, 2025

RISC-V Instruction Set Manual

TeX 3,850 666 Updated Feb 6, 2025

FlagGems is an operator library for large language models implemented in Triton Language.

Python 408 63 Updated Feb 6, 2025

Official QEMU mirror. Please see https://www.qemu.org/contribute/ for how to submit changes to QEMU. Pull Requests are ignored. Please only use release tarballs from the QEMU website.

C 10,831 5,740 Updated Feb 3, 2025

AMD ROCm™ Software - GitHub Home

Shell 4,923 403 Updated Feb 5, 2025

ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime

C++ 235 115 Updated Feb 5, 2025

hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library

Assembly 76 99 Updated Feb 6, 2025

Hook function calls by replacing PLT(Procedure Linkage Table) entries.

C 791 161 Updated Sep 2, 2024

Efficient operation implementation based on the Cambricon Machine Learning Unit (MLU) .

C++ 108 106 Updated Feb 6, 2025

CUDA on non-NVIDIA GPUs

Rust 10,563 683 Updated Feb 3, 2025

CUDA Templates for Linear Algebra Subroutines

C++ 6,140 1,054 Updated Feb 2, 2025

收藏的一些经典的历史、政治、心理、哲学、数学、计算机方面电子书(约10万本)

JavaScript 4,962 683 Updated Sep 28, 2023

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 30,736 12,614 Updated Feb 6, 2025

Optimized primitives for collective multi-GPU communication

C++ 3,420 851 Updated Jan 27, 2025

Demo project for building Python wheels for Linux with Travis-CI

C 227 90 Updated Feb 24, 2021

Python wheels that work on any linux (almost)

Shell 1,497 220 Updated Feb 2, 2025

C/C++ Performance Profiler

C++ 4,258 351 Updated Jan 31, 2025

Development repository for the Triton language and compiler

Python 105 31 Updated Feb 6, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 36,492 5,516 Updated Feb 6, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 8,074 419 Updated Jan 28, 2025

CUDA Core Compute Libraries

C++ 1,429 188 Updated Feb 6, 2025

Shared Middle-Layer for Triton Compilation

MLIR 220 50 Updated Feb 6, 2025
Next