|
--- |
|
title: README |
|
emoji: ⛅ |
|
colorFrom: green |
|
colorTo: blue |
|
sdk: static |
|
pinned: false |
|
--- |
|
|
|
<img src="https://pyke.io/assets/pyke-banner.png" width="170" /> |
|
|
|
# Data |
|
- [👻 **OshiChats v2**](https://huggingface.co/datasets/pykeio/oshichats-v2) - 56 million chat messages from VTuber live streams with smarter filtering, neural quality scores, and even more talents. |
|
- [🎙️ **LibriVox Tracks**](https://huggingface.co/datasets/pykeio/librivox-tracks), a dataset of all 411K audio tracks uploaded to LibriVox before 26th September 2023, complete with reader ID & original text links. |
|
- [👁️🗨️ **OSHIChats v1 (August 2023)**](https://huggingface.co/datasets/pykeio/oshichats-v1-2308), a dataset of 8 million high-quality chat messages collected and filtered from >1,000 VTuber live streams. |