• Stars
    star
    246
  • Rank 164,726 (Top 4 %)
  • Language
  • Created over 4 years ago
  • Updated almost 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

The dataset of all music sheets and users on musescore.com (unmaintained/discontinued since Sep 30, 2021)

musescore-dataset

🚨 The dataset has been left unmaintained since Sep 30, 2021.
Help appreciated if you want to take the risk of becoming the victim of personal harassments

The unofficial dataset of all music sheets and users on musescore.com, dedicated to big data analytics / data science / machine learning.

All data is collected by iterating through musecore.com's public API.

The jsonl files are in the Newline-delimited JSON (JSON Lines) format.

Only need the sheet files to learn music? try musescore-downloader.

View/Query in Google BigQuery

User Data

Update Manually,
Last Updated: Nov 9, 2020

https://musescore-dataset.xmader.com/user.jsonl

Music Sheet Metadata

Last Updated: Sep 30, 2021

https://musescore-dataset.xmader.com/score.jsonl

All mscz files

Last Updated: Sep 30, 2021

https://musescore-dataset.xmader.com/mscz-files.csv

# The CSV file itself is on IPFS
# ipns://QmSdXtvzC8v8iTTZuj5cVmiugnzbR1QATYRcGix4bBsioP
cid=$(curl https://musescore-dataset.xmader.com/csv-ipfs-ref | grep -o "\\w\{46\}")
wget -O mscz-files.csv https://ipfs.io/ipfs/${cid}/mscz-files.csv

This is a csv file, which contains score id (id) and the corresponding IPFS reference (ref) to each mscz file.

All files are available on IPFS.
NO ONE CAN TAKE IT DOWN NOW!

Bulk Download

We (LibreScore team) don't condone mass downloads using regular methods.
USE AT YOUR OWN RISK

See https://discord.com/channels/774491656643674122/777457743983411221/1032054445422420039

(You must join the LibreScore Community Discord first to see the message.)
Discord

Download mscz files via IPFS HTTP Gateways

#!/bin/bash
while IFS=, read -r id ref
do
    if [ -f "$id.mscz" ]; then
        echo "$id.mscz exists."
    else
        echo "$id.mscz does not exist."
        wget -nv --read-timeout=3 https://ipfs.io$ref -O $id.mscz
    fi
done < <(sed '1d' mscz-files.csv)

Using CURL

#!/bin/bash
while IFS=, read -r id ref
do
    if [ -f "$id.mscz" ]; then
        echo "$id.mscz exists."
    else
        echo "$id.mscz does not exist."
        curl -\# -f https://ipfs.io$ref -o $id.mscz -m 3
    fi
done < <(sed '1d' mscz-files.csv)

Or using local IPFS daemon

#!/bin/bash

# Install IPFS https://docs.ipfs.io/how-to/command-line-quick-start/#install-ipfs

ipfs daemon --init &

while IFS=, read -r id ref
do
    ipfs get $ref -o $id.mscz
done < <(sed '1d' mscz-files.csv)

Help hosting files

You could help musescore-dataset become more accessible by:

  • Hosting (ipfs pin) those mscz files on your own IPFS nodes

    #!/bin/bash
    while IFS=, read -r id ref
    do
        ipfs pin add -r --progress $ref
    done < <(sed '1d' mscz-files.csv)

    or,

  • Asking a public IPFS gateway to periodically fetch and cache file requests

    #!/bin/bash
    # run in a cron job
    while IFS=, read -r id ref
    do
        echo "fetching $id.mscz"
        curl -\# -f https://ipfs.io$ref -o $id.mscz -m 0.5
        rm -f $id.mscz
    done < <(sed '1d' mscz-files.csv | shuf)

Contact me if you have any questions.

The purpose of the project is to make the data of musescore.com accessible to anyone in need, and bring a clean and high-quality music dataset to the world of computer science, but not for individuals who only want to keep the data pointlessly.

More Repositories

1

musescore-downloader

⚠️ This repo has moved to https://github.com/LibreScore/dl-librescore ⚠️ | Download sheet music (MSCZ, PDF, MusicXML, MIDI, MP3, download individual parts as PDF) from musescore.com for free, no login or Musescore Pro required | 免登录、免 Musescore Pro,免费下载 musescore.com 上的曲谱
TypeScript
2,704
star
2

aria-ng-gui

一个 Aria2 图形界面客户端 | An Aria2 GUI for Windows & Linux & MacOS
JavaScript
1,709
star
3

aria-ng-gui-android

一个 Aria2 图形界面安卓客户端 | An Aria2 GUI Android App
HTML
675
star
4

google-recorder

Get the powerful Google Recorder app run on any Android device (Android >= 9)
Shell
62
star
5

retidy

Extract, unminify, and beautify ("retidy") each file from a webpack/parcel bundle (JavaScript reverse engineering)
TypeScript
43
star
6

hydrogen

A Git Based Task, Note, Todo Management Tool for Linux & Windows | forked from: http://origingroup.tech
JavaScript
21
star
7

Bring_UserScript_to_Via

将 UserScript/油猴脚本 带到Via浏览器
JavaScript
13
star
8

gse-wasm

WebAssembly efficient text segmentation; support english, chinese, japanese and other. WebAssembly 高性能分词
Go
9
star
9

pfs-file

Decrypt and Extract *.pfs game resource files
JavaScript
9
star
10

xmind_zen_linux

Xmind Zen for Linux
CSS
6
star
11

twitter-video-dl

The easiest way to download videos from Twitter, available as CLI tool, Userscript with GUI button (in progress), serverless function for Cloudflare Workers, and a plugin for AriaNgGUI (in progress)
TypeScript
6
star
12

TimeZoneConverter-app

A Time Zone Converter
JavaScript
4
star
13

flv2aac

从 FLV 文件中提取 AAC 格式的音频
JavaScript
3
star
14

ipfs-types

The latest (v0.49) TypeScript type definitions for js-ipfs and more
TypeScript
3
star
15

ascii_live

🐸ASCII字符画视频流
JavaScript
3
star
16

aria-ng-gui-npm

AriaNgGUI on npm
JavaScript
3
star
17

flac-split

A tool to split a flac file into separate tracks with a cue sheet
JavaScript
3
star
18

Xmader.github.io

HTML
2
star
19

ipfs-repo-types

TypeScript typing definition for https://github.com/ipfs/js-ipfs-repo
2
star
20

youbbs-backup-helper

Backup youBBS sites easily 轻松备份 youBBS 站点, 不需要管理员或数据库访问权限
TypeScript
2
star
21

TimeZoneConverter

Time Zone Converter - Calculate exact time anywhere in the world across time zones online. Use the world clock to see the current time around the world.
JavaScript
1
star
22

wecenter-backup-helper

Backup WeCenter sites with ease
TypeScript
1
star
23

msdl-conversion

1
star
24

bing_img

自动将bing首页图片作为壁纸
JavaScript
1
star
25

mohu

A simple Windows & Linux APP of Mohu, no climbing over the GFW needed | 一个简单的膜乎免 番羽土啬 Windows & Linux APP
JavaScript
1
star
26

re-lief

KiriKiri Game Resources of Re:LieF, extracted by https://github.com/Xmader/XP3Parser
Mathematica
1
star
27

PeerSend

在微信中(或其它地方)以文本形式发送加密消息,支持 Web & Android (在 releases 中下载,仅支持 Android 6 及以上)
Vue
1
star
28

rubao_manual

乳包指南
Vue
1
star