Skip to content
View lcq012's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report lcq012

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 4 Updated Dec 6, 2024

EMNLP'2023 (Findings): Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!

Python 41 4 Updated Apr 12, 2024

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 16,177 2,328 Updated Feb 10, 2025

[NeurIPS'24 Spotlight, ICLR'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an …

Python 906 45 Updated Jan 31, 2025
Python 4 Updated Jan 7, 2025

Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

1,066 46 Updated Jul 31, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 15,156 1,032 Updated Feb 8, 2025

[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Python 173 12 Updated Mar 25, 2024

Reading list of aspect-based sentiment analysis.

217 34 Updated Jul 29, 2023

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,415 419 Updated Aug 7, 2024

A Framework of Small-scale Large Multimodal Models

Python 733 80 Updated Jan 28, 2025

This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.

Python 392 61 Updated Apr 24, 2024

[ACL 2024] GroundingGPT: Language-Enhanced Multi-modal Grounding Model

Python 314 17 Updated Nov 4, 2024

Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。

Python 27,401 2,027 Updated Feb 11, 2025

How to run the distributed TensorFlow in a Kubernetes cluster

Python 7 Updated Jul 22, 2020
Java 8 1 Updated Oct 28, 2021

Tetris, a model predictive control (MPC)-based container scheduling strategy to judiciously make migration decisions for long-running containerized workloads. Tetris can achieve the long-term optim…

Python 22 4 Updated Dec 30, 2024

DelayStage is a simple yet effective stage delay scheduling strategy to interleave the cluster resources across the parallel stages, so as to increase the cluster resource utilization and speed up …

Scala 14 2 Updated Sep 7, 2023

spotDNN is a heterogeneity-aware spot instance provisioning framework to provide predictable performance for DDNN training workloads in the cloud.

Python 15 2 Updated Sep 7, 2023

iSpot is a lightweight and cost-effective instance provisioning framework for Directed Acyclic Graph (DAG)-style big data analytics, in order to guarantee the application performance on cloud tran…

Scala 11 3 Updated Sep 7, 2023

ebrowser, an energy-efficient and lightweight human interaction framework without degrading the user experience in mobile Web browsers.

C++ 12 1 Updated Sep 7, 2023

iGniter, an interference-aware GPU resource provisioning framework for achieving predictable performance of DNN inference in the cloud.

Python 37 7 Updated Jun 11, 2024

Prophet is a predictable communication scheduling strategy to schedule the gradient transfer in an adequate order, with the aim of maximizing the GPU and network resource utilization.

Python 15 1 Updated Sep 13, 2023
Jupyter Notebook 12 1 Updated Sep 20, 2023

λDNN is a cost-efficient function resource provisioning framework to minimize the monetary cost and guarantee the performance for DDNN training workloads in serverless platforms.

Python 23 4 Updated Oct 25, 2023

Reading paper list for iCloud group

13 7 Updated Feb 10, 2025

Opara is a lightweight and resource-aware DNN Operator parallel scheduling framework to accelerate the execution of DNN inference on GPUs.

Python 19 2 Updated Dec 19, 2024

基于知识迁移的情感-原因对抽取

Python 2 Updated Jun 11, 2022

ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型

7,119 557 Updated Jan 4, 2025

搜索引擎原理

1,574 131 Updated Apr 19, 2024
Next