DialogExtractor

DialogExtractor is a specialized tool designed to extract emotionally rich dialogues from audiobooks of novels. This project stems from the need to enhance Text-to-Speech (TTS) systems by providing them with high-quality, emotion-laden audio data, which is crucial for creating more expressive and realistic speech outputs.

Overview

Traditional TTS systems can usually only synthesize simple emotions such as happiness and sadness. However, complex emotions in real life, such as bitterness and sadness, are difficult to control through text prompts, and there is even a lack of relevant data sets. To this end, DialogExtractor constructs an emotionally rich audio database by accurately extracting the dialogues of characters in rich audio novels. By leveraging this tool, developers can significantly improve the emotional responsiveness of their TTS applications, making them more attractive and realistic. The whole process is shown in the figure below:

Challenges

Accurate positioning of character dialogues: Since it is difficult to mark " " with current ASR technology, it is difficult to determine the location of dialogues only through audio files. To this end, we first extract all the dialogues from the novel text content, and then mark the audio text extracted by ASR to determine the location of the dialogues.
Extraction of dialogue audio: We need to determine which parts of an audio are dialogues and extract them accurately. To do this, we use FunASR to timestamp and then extract them.
Audio filtering: After extraction, some dialogues may be too short to be used, so we need to filter the audio again.

How to use it

Create the Env:

conda create -n name python=3.9
conda activate name
pip install -r requirements.txt

Get Dialog From Webpage:

In this step, you need to extract the dialogues of the characters based on the HTML code of the page where your target novel is located. Here, we take https://m.xbiqugew.com/ as an example to crawl the dialogues of the novel. The parameters that need to be changed are: 1. Web link. 2. Start page. 3. End interface. 4. Crawl location.

python get_web.py

Preprocessing

Due to a series of issues such as data format, audio length, etc., the audio data needs to be initially screened.

python mp3twav.py
python delnonewav.py
python cut.py

ASR

Perform ASR processing on all audio and identify the corresponding text.

cd asr
python asr.py

Filtering

Filter out text containing dialogue.

python get_d.py

Crop audio

The original audio is trimmed to obtain the character dialogue.

python cutaudio.py

Other tools

Get the number of audio in a folder

python number.py

Get the total time of audio in a folder

python gettime.py

Rename all audios in a folder

python rename.py

Emotion Recognition

Click here

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DialogExtractor

Overview

Challenges

How to use it

Create the Env:

Get Dialog From Webpage:

Preprocessing

ASR

Filtering

Crop audio

Other tools

Get the number of audio in a folder

Get the total time of audio in a folder

Rename all audios in a folder

Emotion Recognition

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
asr		asr
img		img
.DS_Store		.DS_Store
README.md		README.md
cut.py		cut.py
cutaudio.py		cutaudio.py
delnonewav.py		delnonewav.py
get_d.py		get_d.py
gettime.py		gettime.py
getweb.py		getweb.py
index.html		index.html
mp3twav.py		mp3twav.py
number.py		number.py
rename.py		rename.py
requirements.txt		requirements.txt

LuckyBian/DialogExtractor

Folders and files

Latest commit

History

Repository files navigation

DialogExtractor

Overview

Challenges

How to use it

Create the Env:

Get Dialog From Webpage:

Preprocessing

ASR

Filtering

Crop audio

Other tools

Get the number of audio in a folder

Get the total time of audio in a folder

Rename all audios in a folder

Emotion Recognition

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages