Skip to content

seungheondoh/music_caps_dl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Music Caps Donwloader

Unofficial download repository for MusicCaps

The MusicCaps dataset contains 5,521 music examples, each of which is labeled with an English aspect list and a free text caption written by musicians.

Current Status

  • ( 5495 / 5521 ), 26 audio missing

Quick Start

conda create -n YOUR_ENV_NAME python=3.8
conda activate YOUR_ENV_NAME
pip install -r requirements.txt
python main.py

ydl_opts

I've crawled the all audio. If you want to use storage space efficiently, please uncomment below and download 10sec.

def _download_audio(x):
    (
        ytid,
        start,
        end,
        out_dir,
    ) = x

    ydl_opts = {
        "outtmpl": f"{out_dir}/%(id)s.%(ext)s",
        # "outtmpl": f"{out_dir}/[{ytid}]-[{start//1000}-{end//1000}].%(ext)s",
        "format": "bestaudio[ext=webm]/bestaudio/best",
        "external_downloader": "ffmpeg",
        "external_downloader_args": [
            "-loglevel",
            "panic",
            "-http_proxy",
            "socks5://127.0.0.1:1080"
        ],
        # "external_downloader_args": [
        #     "-ss",
        #     str(start_dt),
        #     "-to",
        #     str(end_dt),
        #     "-loglevel",
        #     "panic",
        # ],
        "postprocessors": [
            {
                "key": "FFmpegExtractAudio",
                "preferredcodec": "mp3",
            }
        ],
        "quiet": True,
        "no-mtime": True,
    }
    try:
        with YoutubeDL(ydl_opts) as ydl:
            ydl.download([f"https://www.youtube.com/watch?v={ytid}"])
    except KeyboardInterrupt:
        raise
    except Exception:
        pass

issue

# invalid start time
WARNING: [youtube] Invalid start time (53886.0 < 0) for chapter "26-08-09 Melding : 13-119 A1 AMSTERDAM BOLSTOEN -- RIT:175"
WARNING: [youtube] Invalid start time (380.0 < 0) for chapter "Genre:"
WARNING: [youtube] Invalid start time (380.0 < 0) for chapter "Duur"                                        
...

# issue: Private video & unavailable & terminated
ERROR: [youtube] 0J_2K1Gvruk: Private video. Sign in if you've been granted access to this video
ERROR: [youtube] 63rqIYPHvlc: Private video. Sign in if you've been granted access to this video
ERROR: [youtube] Ah_aYOGnQ_I: Private video. Sign in if you've been granted access to this video
...
ERROR: [youtube] Akg1n9IWSrw: Video unavailable. This video is no longer available because the YouTube account associated with this video has been terminated.
ERROR: [youtube] B7iRvj8y9aU: Video unavailable. This video is no longer available because the YouTube account associated with this video has been terminated.
ERROR: [youtube] NIcsJ8sEd0M: Video unavailable. This video is no longer available because the YouTube account associated with this video has been terminated.
...

# issue: country(KR)
ERROR: [youtube] KMQmM12G9Z4: Video unavailable. This video contains content from WMG, who has blocked it in your country on copyright grounds

Reference

About

Unofficial download repository for MusicCaps

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages