Artist Consolidation

Let us know when something isn't working correctly, or if you find a typo. Do not post complaints or suggestions here.

Moderator: Mods

Lunkhead
Princess Zelda
Posts: 5183
Joined: Sat Sep 25, 2004 12:14 pm
Instruments: many
Recording Method: cubase/reason/mac/tascam4x4
Submitting as: Berkeley Social Scene, Merisan, Tiny Robots
Location: Berkeley, CA
Contact:

Artist Consolidation

Postby Lunkhead » Sun Jun 29, 2014 9:13 am

Dear FMs, what do you think of consolidating the following artists? I initially came upon two of these by chance, but then it occurred to me to whip up a bit of code to find similar artist names (using Levenshtein distance for now) and found many more. I think it'd be an improvement to the archive. I might just go ahead and consolidate them all on the Jukebox as I've set up a simple form for doing that.

Tobin's Spirit Guide -> Tobin's Spirit Guide (merge the one with the fancy curly apostrophe into the the one with the simple straight apostrophe)

Thanks Glenny for the Frisbee -> Thank Glennny for the Frisbee (the second one has three entries and has "Glennny" spelled right)

The Bersfordians -> The Beresfordians (second one shows up in Google search results other than Song Fight! related results)

Big Matt Hyatt and his Rusty Red Riders -> Big Matt Hyatt and His Rusty Red Ryders (second one has two songs, first only has one)

ChinMusic -> Chin Music (second has way more entries)

Chips, Abbott & Ray -> Chips, Abbott, & Ray (serial comma)

A Werkenhorse -> Daj Werkenhorse
Dan Werkenhorse -> Daj Werkenhorse (Daj has a bunch of entries, the others each have only one, they all sound like the same dude)

Dr Spectacular's Power Circus -> Dr. Spectacular's Power Circus

Elastic Wasteband -> Elastic Waste Band (second has two entries, first only one)

Evil E -> Evil-E

Flvxxvm Forvm -> Flvxxvm Florvm

Fortune's Favorite -> Fortune's Favorites

Freddie Love -> Freddielove

Hey, It's Romer -> Hey it's Romer

The Interchangables -> The Interchangeables (although I like the ring of "The Inter-chang-ables" a la Ben Chang from "Community")

JeebasJones -> Jeebas Jones

Links vs. Music -> Link vs. Music

MC Milk Plus -> MC Milk-Plus (Google search results indicate he uses the dash)

Meat Knob -> Meatknob

The Mexican Champanzees -> The Mexican Chimpanzees (although the misspelled one is funny)

Napoleons Toes -> Napoleon's Toes

Nobody, et al -> Nobody, et al.

Project-D -> Project D

RadioShow -> Radio Show

Ratt Poizon -> RattPoizon

rice Henry and the Transformers -> Brice Henry and the Transformers

RioMondo -> Rio Mondo

Ryan Rickenback -> Ryan Rickenbach

Star Crossed Voyager -> Star-Crossed Voyager

Also, dear eclectic spoons guy, wtf? Is it "her" or "the" spoons? Is it "Eclectic" or "@eclectic"? Is it "SpOOns" or "spOOns" or "sp00ns"? Oy!
Lunkhead
Princess Zelda
Posts: 5183
Joined: Sat Sep 25, 2004 12:14 pm
Instruments: many
Recording Method: cubase/reason/mac/tascam4x4
Submitting as: Berkeley Social Scene, Merisan, Tiny Robots
Location: Berkeley, CA
Contact:

Re: Artist Consolidation

Postby Lunkhead » Sun Jun 29, 2014 9:14 am

There's also "Space Pub" and "Spacepub", but I'm not sure which way to go with that one. Maybe j$ could clarify. It sounds like him singing on those.
User avatar
fluffy
Ganon
Posts: 9364
Joined: Sat Sep 25, 2004 10:56 am
Instruments: sometimes
Recording Method: Logic Pro X
Submitting as: Sockpuppet
Location: The Plaidlands (also, Seattle)
Contact:

Re: Artist Consolidation

Postby fluffy » Sun Jun 29, 2014 10:26 am

IANAFM, but sounds good to me, at least for the cases where it isn't someone purposefully slightly changing their name every week for humor value.

What'd be really awesome is a properly normalized scheme where there's an abstract ID that multiple entry names can map onto but it's hard enough getting people to submit correctly as it is.
User avatar
Spud
Notable Hylian
Posts: 4739
Joined: Fri Sep 24, 2004 10:25 am
Instruments: Bass, Keyboards, eHorn
Submitting as: Octothorpe
Location: Seattle
Contact:

Re: Artist Consolidation

Postby Spud » Sun Jun 29, 2014 11:56 am

What'd really be awesome would be if people would spell their band name the same every time. You know where these come from, right, people?

Not putting down your idea for an abstract ID, fluffy, that would be cool, but it would require some sort of additional registration on the part of entrants.
"I only listen to good music. And Octothorpe." - Marcus Kellis
Song Fight! The Rockening
User avatar
Caravan Ray
poopbutt
Posts: 8010
Joined: Sat Sep 25, 2004 1:51 pm
Instruments: Penis
Recording Method: Garageband
Submitting as: Caravan Ray,G.O.R.T.E.C,Lyricburglar,The Thugs from the Scallop Industry
Location: Toowoomba, Queensland
Contact:

Re: Artist Consolidation

Postby Caravan Ray » Sun Jun 29, 2014 2:30 pm

"Caravan Ray" has 70 entries, and "Caravan ray" has one. I am guessing that one of those may have been a typo.

Oh - there is also a Caravan Ray 1 and Caravan Ray 2 - which actually were not my typos or the names I used - they came from a special fight for JB where I did 2 songs, and they came up under those names, I assume for voting purposes. They look a bit funny in the archive that way.
User avatar
fluffy
Ganon
Posts: 9364
Joined: Sat Sep 25, 2004 10:56 am
Instruments: sometimes
Recording Method: Logic Pro X
Submitting as: Sockpuppet
Location: The Plaidlands (also, Seattle)
Contact:

Re: Artist Consolidation

Postby fluffy » Sun Jun 29, 2014 4:28 pm

Spud wrote:What'd really be awesome would be if people would spell their band name the same every time. You know where these come from, right, people?

Not putting down your idea for an abstract ID, fluffy, that would be cool, but it would require some sort of additional registration on the part of entrants.

Well, yeah, that's what I was trying to imply with the idea of an abstract ID in the first place. And of course people will completely lose their registration information and whatever and just open a new account and not really help anything.
Lunkhead
Princess Zelda
Posts: 5183
Joined: Sat Sep 25, 2004 12:14 pm
Instruments: many
Recording Method: cubase/reason/mac/tascam4x4
Submitting as: Berkeley Social Scene, Merisan, Tiny Robots
Location: Berkeley, CA
Contact:

Re: Artist Consolidation

Postby Lunkhead » Sun Jun 29, 2014 5:41 pm

Caravan Ray wrote:"Caravan Ray" has 70 entries, and "Caravan ray" has one. I am guessing that one of those may have been a typo.

Oh - there is also a Caravan Ray 1 and Caravan Ray 2 - which actually were not my typos or the names I used - they came from a special fight for JB where I did 2 songs, and they came up under those names, I assume for voting purposes. They look a bit funny in the archive that way.


If you go to the profile page for either "Caravan Ray" or "Caravan ray" you'll see that the case doesn't matter and there is only one profile page, with all 71 songs. This is the case on the official archive and the Jukebox. On the Jukebox there can't be both "Caravan Ray" and "Caravan ray" as artist names, but it looks like, even though there is only one profile for both in the official archive, the list of artists still shows both, which seems a little weird.

With the other ones, the numbers are there so people can tell the entries apart on the fight page. To deal with it better would be a bit complicated. There'd have to be a piece of data about every song indicating what number entry it was for that artist for that fight. For most songs, that would just be "1", and in the few odd cases like yours it would be "1" and "2". Then the songs could both be associated with the artist "Caravan Ray" and there'd be some other data to show to distinguish them if necessary. On the Jukebox side, I would add a new column to the songs table in my database and default the value to 1 for all songs, then munge the data either manually or with a script from there. Then I'd have to change the display code to check if an artist has >1 songs in a fight, and if so, show those numbers next to their name... Not sure how it could be handled on the official archive song but it seems like it might potentially be more complicated because of how the data is stored. Just speculating though.
User avatar
fluffy
Ganon
Posts: 9364
Joined: Sat Sep 25, 2004 10:56 am
Instruments: sometimes
Recording Method: Logic Pro X
Submitting as: Sockpuppet
Location: The Plaidlands (also, Seattle)
Contact:

Re: Artist Consolidation

Postby fluffy » Sun Jun 29, 2014 6:10 pm

The way the official archive is structured right now it would be impossible to handle it well. A normalized abstract entrant and entry ID (with different display names for the two entries from the same entrant) would work better, although still not perfect for voting purposes.
User avatar
fluffy
Ganon
Posts: 9364
Joined: Sat Sep 25, 2004 10:56 am
Instruments: sometimes
Recording Method: Logic Pro X
Submitting as: Sockpuppet
Location: The Plaidlands (also, Seattle)
Contact:

Re: Artist Consolidation

Postby fluffy » Sun Jun 29, 2014 6:11 pm

Also, while normalizing solves one problem, it introduces another - how do we handle collaborations?
Lunkhead
Princess Zelda
Posts: 5183
Joined: Sat Sep 25, 2004 12:14 pm
Instruments: many
Recording Method: cubase/reason/mac/tascam4x4
Submitting as: Berkeley Social Scene, Merisan, Tiny Robots
Location: Berkeley, CA
Contact:

Re: Artist Consolidation

Postby Lunkhead » Sun Jun 29, 2014 6:31 pm

Yeah, handling those better would be nice. In the case of there being a "primary" artist and a guest/featured/etc. artist, a simple option would be to add a piece of text data to every song that could hold the "featuring So-and-So" info, to keep it out of the primary artist's name, so the artist name could stay clean, and the collaborations could then be consolidated under the "primary" artist. More better of course would be for songs to have a many-to-many relationship to artists. Maybe those relationships would have a "primary" flag or not, maybe a text field holding the text for describing the relationship (in case people didn't want it to just be a list of the artists' name, e.g. if they wanted "Example Artist with So-and-So").

Whenever I think of this kind of stuff now though I think of Google's knowledge graph and how ultimately if you really want to keep breaking things down that's one place where you'd wind up. That's a bit beyond me though. ;)
User avatar
Spud
Notable Hylian
Posts: 4739
Joined: Fri Sep 24, 2004 10:25 am
Instruments: Bass, Keyboards, eHorn
Submitting as: Octothorpe
Location: Seattle
Contact:

Re: Artist Consolidation

Postby Spud » Tue Jul 01, 2014 12:41 am

By the way, did I forget to mention that I did in fact do the consolidations, as requested?

SPUD
"I only listen to good music. And Octothorpe." - Marcus Kellis
Song Fight! The Rockening
User avatar
fluffy
Ganon
Posts: 9364
Joined: Sat Sep 25, 2004 10:56 am
Instruments: sometimes
Recording Method: Logic Pro X
Submitting as: Sockpuppet
Location: The Plaidlands (also, Seattle)
Contact:

Re: Artist Consolidation

Postby fluffy » Tue Jul 01, 2014 12:54 am

Thanks, Spud!
Lunkhead
Princess Zelda
Posts: 5183
Joined: Sat Sep 25, 2004 12:14 pm
Instruments: many
Recording Method: cubase/reason/mac/tascam4x4
Submitting as: Berkeley Social Scene, Merisan, Tiny Robots
Location: Berkeley, CA
Contact:

Re: Artist Consolidation

Postby Lunkhead » Tue Jul 01, 2014 8:07 am

Awesome, thanks Spud!
Lunkhead
Princess Zelda
Posts: 5183
Joined: Sat Sep 25, 2004 12:14 pm
Instruments: many
Recording Method: cubase/reason/mac/tascam4x4
Submitting as: Berkeley Social Scene, Merisan, Tiny Robots
Location: Berkeley, CA
Contact:

Re: Artist Consolidation

Postby Lunkhead » Tue Jul 01, 2014 8:32 am

OK, done on the Jukebox side too. Hooray for cleaner data!

Return to “Quality Control”

Who is online

Users browsing this forum: No registered users and 1 guest