-
Soochow University
- Suzhou, Jiangsu
-
12:28
(UTC +08:00) - https://www.cnblogs.com/charlton-99ing/
Lists (2)
Sort Name ascending (A-Z)
Stars
EMNLP'2023 (Findings): Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
[NeurIPS'24 Spotlight, ICLR'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an …
Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
Reading list of aspect-based sentiment analysis.
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
A Framework of Small-scale Large Multimodal Models
This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.
[ACL 2024] GroundingGPT: Language-Enhanced Multi-modal Grounding Model
Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。
How to run the distributed TensorFlow in a Kubernetes cluster
Tetris, a model predictive control (MPC)-based container scheduling strategy to judiciously make migration decisions for long-running containerized workloads. Tetris can achieve the long-term optim…
DelayStage is a simple yet effective stage delay scheduling strategy to interleave the cluster resources across the parallel stages, so as to increase the cluster resource utilization and speed up …
spotDNN is a heterogeneity-aware spot instance provisioning framework to provide predictable performance for DDNN training workloads in the cloud.
iSpot is a lightweight and cost-effective instance provisioning framework for Directed Acyclic Graph (DAG)-style big data analytics, in order to guarantee the application performance on cloud tran…
icloud-ecnu / ebrowser
Forked from ebrowser-cloud/ebrowserebrowser, an energy-efficient and lightweight human interaction framework without degrading the user experience in mobile Web browsers.
iGniter, an interference-aware GPU resource provisioning framework for achieving predictable performance of DNN inference in the cloud.
Prophet is a predictable communication scheduling strategy to schedule the gradient transfer in an adequate order, with the aim of maximizing the GPU and network resource utilization.
λDNN is a cost-efficient function resource provisioning framework to minimize the monetary cost and guarantee the performance for DDNN training workloads in serverless platforms.
Reading paper list for iCloud group
Opara is a lightweight and resource-aware DNN Operator parallel scheduling framework to accelerate the execution of DNN inference on GPUs.
ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型