STAT154

Course Project for STAT 154 in UC Berkeley

Trade-off: Lightweight BERT for QA

Overview

"Trade-off" explores a lightweight BERT model for Chinese Question Answering (QA), demonstrating its effectiveness and efficiency compared to Large Language Models (LLMs).

Dataset

DRCD: Delta Reading Comprehension Dataset.
ODSQA: Open-Domain Spoken Question Answering Dataset.

Models

BERT and its variants (ALBERT, RoBERTa).
Comparative analysis with larger LLMs (Qwen-7B, Baichuan 2).

Methodology

Fine-tuning BERT for QA tasks, focusing on preprocessing, training, and postprocessing techniques.

Results

BERT variants, particularly RoBERTa, showed high accuracy, outperforming some larger LLMs in specific tasks.

Future Work

Expanding dataset size, diversifying QA tasks, and adjusting language settings for comprehensive analysis.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
Code		Code
Dataset		Dataset
Experiment_Log		Experiment_Log
Model_weight		Model_weight
README.md		README.md
Trade_off__A_Light_Weight_Model_Solving_QA.pdf		Trade_off__A_Light_Weight_Model_Solving_QA.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

STAT154

Trade-off: Lightweight BERT for QA

Overview

Dataset

Models

Methodology

Results

Future Work

About

Releases

Packages

Languages

lingjiechen2/STAT-154-Lightweight_BERT_for_QA

Folders and files

Latest commit

History

Repository files navigation

STAT154

Trade-off: Lightweight BERT for QA

Overview

Dataset

Models

Methodology

Results

Future Work

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages