Skip to content

dadwadw233/QBot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

QBot: Clone your QQ friends

Overview

QBot is an pipeline for training large language models (LLMs) using QQ chat data. The project covers the full workflow: decrypting chat records, extracting and preprocessing data, LoRA fine-tuning, reward model training, RLHF, and Huggingface-based demos.


Quick Start

1. Install dependencies (TBD)

pip install -r requirements.txt

2. Decrypt QQ chat database

please quit your linux qq before runing script

python scripts/dump_qq_data.py

3. Extract chat data to JSON (interactive script)

python scripts/extract.py --profile_db data/raw/path-to-profile_info.decrypt.db --msg_db data/raw/path-to-nt_msg.decrypt.db

4. Preprocess data

python scripts/preprocess.py /path/to/data /path/to/sensitive_words.txt

TBD


About

🤖 Clone your friends from QQ chat history — personality reconstruction from messages.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages