SongPrep-7B / README.md

root

init commit

242e4df about 1 month ago

1.67 kB

metadata

language:
  - zh
  - en
pipeline_tag: automatic-speech-recognition

SongPrep

Demo | Paper | Code | Dataset

This repository is the official weight repository for SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Transcription. In this repository, we provide the SongPrep-7B model that has been trained on the Million Song Dataset.

Model Versions

Model	#Params	HuggingFace
SongPrep	7B	you are here

Citation

@misc{tan2025songpreppreprocessingframeworkendtoend,
      title={SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Transcription}, 
      author={Wei Tan and Shun Lei and Huaicheng Zhang and Guangzheng Li and Yixuan Zhang and Hangting Chen and Jianwei Yu and Rongzhi Gu and Dong Yu},
      year={2025},
      eprint={2509.17404},
      archivePrefix={arXiv},
      primaryClass={eess.AS},
      url={https://arxiv.org/abs/2509.17404}, 
}

License

The code and weights in this repository is released in the LICENSE file.