How can I batch report spammers?

spam
Tags: #<Tag:0x00007fe3d41230a8>

#1

I believe I found about 892 spammers:

https://musicbrainz.org/search?limit=25&method=advanced&page=36&query=buy+%2Bmg&type=editor

another example of a search revealing probably mostly spammers (930):

https://musicbrainz.org/search?query=service+%2Bcenter&type=editor&limit=25&method=advanced

Most of these probably never made any edits and just add spammy links and stuff on their profile, but both of these lists have at least one active spammer.


#2

There’s no way to batch report spammers. There might be something in SpamBrainz once that rolls around, but please, please, please do not report spammers until we have SpamBrainz, unless there’s something that really needs urgent attention. (E.g., a spammer that continues to add spam edits after its first few edits, or a spammer that links to something truly heinous. Pharmacy, real estate, etc. spammers are just annoying, but we’ll deal with them when we have a better framework for it (SpamBrainz).)


Voting/Auto-editor Request Thread
#3

:open_mouth: That’s new to me. Why not?


#4

Because combing through ~2000 editor reports will 1) take hours and hours and hours to handle, hours that will not be going to e.g., getting ready for GCI or handling reports of actual human editors, 2) burn me out (I don’t want to look at 2000+ spam accounts, repeating the same steps over and over to delete them), 3) be a drop in the ocean—we probably get at least about 400,000 new spam accounts/month, or around 13,000 accounts per day. I wouldn’t be able to go through 2000 in one day, and even if I could, there’d already be 11,000 new ones waiting for me when I was done.

The spam issue is too severe to be handled manually (which is the only way we can currently deal with it) → we need better tooling to deal with it, and for now we’re putting our eggs in the SpamBrainz basket.


#5

I didn’t know the situation was that bad :frowning: Yes, sounds like this can only be handled by some automation. After all the spammers automate themselves, fighting against this with human labor is going to be a loosing battle.


#6

That’s insane! If you’re ever looking for a blog post (or conference…) topic I would be fascinated to hear more about the ins and outs of why and how and what.


#7

Especially if you see the actual Editor numbers:
From 1’977’737 valid Editors are 772’009 inactive. Another 992’776 has “validated email only”. From the difference (212’952 ever active Editors) only 1’309 “edited and/or voted in the last 7 days”.


#8

Hm. Looking at the MB stats, the numbers I have may be slightly (or drastically) off. If we got 400,000 users in 31 days, we’d be way past 1,977,737 editors by now. The 400k was a figure I got from @Zas, and I didn’t actually verify it myself. I’ll try and investigate more tomorrow.


#9

Okay, so, @Zas and I both misread the numbers since the report we were looking at apparently didn’t actually take the dates given to it into account. :man_facepalming:

We’re looking at some other stats and the forum is getting around 90–140 new “users” per day. Most of these are likely bot-created, which means most likely spam accounts (passive, active, or “sleepers”). The forum also gets less than MusicBrainz.org, since 1) I have blacklisted a number of known spammer-only e-mail domains from getting accounts on the forum at all, so forum stats are not fully representative, and 2) accounts only get created here once the e-mail used for MB signup has been verified—so there are probably a fair number more “bad” accounts on MB, but probably not in the 1000s/day range… yet/luckily. :slight_smile:

Still, the numbers are too damn high, but we need better tooling before we can do anything effective about it, so the plea to not report spammers still hold.