Settings: In terms of tag-conditioned music generation, we initiate with a “start-of-song” event, then feed relevant tags into either MST-Tags-Small or MST-Tags. This is followed by a “start-of-notes” event to allow the model to begin generating notes. For text-conditioned music generation, we input the text embedding corresponding to the provided text into MST-Text and use a “start-of-song” event to prompt the model to start generating notes.
Model | Condition | Audio Sample |
---|---|---|
MST-Tags-Small | Classical/Traditional, Hard, ludwig van beethoven, Piano | |
MST-Tags | Classical/Traditional, Hard, ludwig van beethoven, Piano | |
MST-Text | This is an easy classical piano piece composed by wolfgang amadeus mozart. |
Model | Condition | Audio Sample |
---|---|---|
MST-Tags-Small | Rock/Metal, Electric-guitar, Electric-bass, Piano | |
MST-Tags | Rock/Metal, Electric-guitar, Electric-bass, Piano | |
MST-Text | This is a rock or metal music. It has piano, guitar and bass within it. |
Model | Condition | Audio Sample |
---|---|---|
MST-Tags-Small | Drum, Synth_Bass | |
MST-Tags | Drum, Contrabass, Piccolo | |
MST-Text | This music has drum within it |
Dataset | Description | Number of Samples | Tag Information | Text Information |
---|---|---|---|---|
MetaScore-Raw | The raw MuseScore files and metadata scraped from the MuseScore forum | 963K | ✔ | |
MetaScore-Genre | A subset of MetaScore-Full that contains ground truth genre tags and LLM-generated Captions | 181K | ✔ | ✔ |
MetaScore-Plus | MetaScore-Raw were missing genre tags are completed by the trained genre tagger. LLM captions based on those tags are also provided | 963K | ✔ | ✔ |
Model | Training Dataset | Input Type | Model Size | Training Samples |
---|---|---|---|---|
MST-Tags-Small | MetaScore-Genre | Tag | 87.36M | 150K |
MST-Tags | MetaScore-Plus | Tag | 87.36M | 901K |
MST-Text | MetaScore-Plus | Text | 87.44M | 560K |
We show annotation examples in MetaScore in the following table. The first two examples are from genre tagger test set with both true genre tags and inferred genre tags. Example 3,4, 6 and 7 are from those without true genre tags. Example 5 doesn’t have inferred genre tags because they are from genre tagger training set. Note that the following samples are not generated by AI.
Audio Sample | True Genre | Inferred Genre | Extracted Metadata | LLM-Captions |
---|---|---|---|---|
Classical/Traditional | Classical/Traditional | Classical/Traditional; Easy; William Marshall; 1 view | A easy classical music piece composed by William Marshall. | |
Rock/Metal | Rock/Metal | Rock/Metal; Intermediate; Piano; 5 comments; 158 favorites; 5862 views | A music piece with a rock/metal vibe. | |
✘ | Soundtrack/Stage | Sountrack/Stage; Trombone, Piano | This is a soundtrack with a trombone and piano in it | |
✘ | Jazz/Blues | Jazz/Blues; intermediate;Piano; 1 comment, 6 favorites and 462 views | A intermediate piano music piece. This music has jazz/blues vibe. | |
Rock/Metal | ✘ | Rock/Metal; Bass; 2 comments; 255 favorites; 5670 views; acaphella; bad romance; cover; lady gaga | A bass music piece with rock/netal vibe. This music piece is a cover of Lady Gaga’s Bad Romance. | |
✘ | Rock/Metal | Rock/Metal; intermediate; Comp: JJ Lin”, “Arr: Daniel Cheah; 4/4 time, Piano, 2 comments, 200 favorites; 3662 vides; 80 tempo | A piano piece in the style of JJ Lin, in 4/4 time, with a tempo of 80 bpm | |
✘ | Folk/Country | Folk/Country; easy; D major; 4/4 time; 2 views | A simple 4/4 piece in D major |
Note: LLM-captions are generated using true genre tags when these are available. If true tags are not available, the captions are generated using inferred genre tags instead.
The following music pieces are generated by MST-Tags-Small given tag conditions.
Condition | Audio Sample |
---|---|
Easy, Piano | |
Advanced, Piano | |
Jazz/Blues, Piano, tenor-saxophone | |
Classical/Traditional, Wolfgang Amadeus Mozart, Easy, Piano | |
Electronic/Dance |
The following music pieces are generated by MST-Tags given tag conditions.
Condition | Audio Sample |
---|---|
Easy, Piano | |
Advanced, Piano | |
Classical/Traditional, Wolfgang Amadeus Mozart, Easy, Piano | |
Electronic/Dance | |
Jazz/Blues |
The following music pieces are generated by MST-Text or BART-base Text-to-Music given text prompts. In each row, we present one example generated with MST-Text and another with BART-base Text-to-Music, both generated from the same text prompt.
Text Prompts | Audio Sample | Audio Sample |
---|---|---|
Generated with MST-Text | Generated with BART-base | |
This music piece is in a sad mood. | ||
A classical music piece composed by Wolfgang Amadeus Mozart. | ||
This is a soundtrack with electronic/dance vibe. | ||
A classical music piece with Jazz/Blues vibe. This music piece is influenced by George Gershwin. | ||
This is an easy folk/country music piece. | ||
This is a intermediate classical music piece. This music piece has piano in it. | ||
This is a cozy music piece. | ||
A classical music piece composed by Michael Jackson. |