Jump to content

Talk:Spam blacklist

From Meta, a Wikimedia project coordination wiki
This is an archived version of this page, as edited by Jorunn (talk | contribs) at 17:00, 16 February 2010 (Essay spam). It may differ significantly from the current version.

Latest comment: 14 years ago by Jorunn in topic Essay spam
Shortcut:
WM:SPAM
WM:SBL
The associated page is used by the MediaWiki Spam Blacklist extension, and lists regular expressions which cannot be used in URLs in any page in Wikimedia Foundation projects (as well as many external wikis). Any meta administrator can edit the spam blacklist. For more information on what the spam blacklist is for, and the processes used here, please see Spam blacklist/About.
Proposed additions
Please provide evidence of spamming on several wikis. Spam that only affects a single project should go to that project's local blacklist. Exceptions include malicious domains and URL redirector/shortener services. Please follow this format. Please check back after submitting your report, there could be questions regarding your request.
Proposed removals
Please check our list of requests which repeatedly get declined. Typically, we do not remove domains from the spam blacklist in response to site-owners' requests. Instead, we de-blacklist sites when trusted, high-volume editors request the use of blacklisted links because of their value in support of our projects. Please consider whether requesting whitelisting on a specific wiki for a specific use is more appropriate - that is very often the case.
Other discussion
Troubleshooting and problems - If there is an error in the blacklist (i.e. a regex error) which is causing problems, please raise the issue here.
Discussion - Meta-discussion concerning the operation of the blacklist and related pages, and communication among the spam blacklist team.
#wikimedia-external-linksconnect - Real-time IRC chat for co-ordination of activities related to maintenance of the blacklist.

Please sign your posts with ~~~~ after your comment. This leaves a signature and timestamp so conversations are easier to follow.


Completed requests are marked as {{added}}/{{removed}} or {{declined}}, and are generally archived (search) quickly. Additions and removals are logged.

snippet for logging
{{sbl-log|1860629#{{subst:anchorencode:SectionNameHere}}}}

Proposed additions

This section is for proposing that a website be blacklisted; add new entries at the bottom of the section, using the basic URL so that there is no link (example.com, not http://www.example.com). Provide links demonstrating widespread spamming by multiple users on multiple wikis. Completed requests will be marked as {{added}} or {{declined}} and archived.

Libel website

Would someone with the power to blacklist a website globally please email me? I do not want to post a link to it here. Thank you in advance. PCHS-NJROTC 18:29, 13 February 2010 (UTC)Reply

Hello PCHS-NJROTC, if your request involves private or sensitive information please email it to OTRS to info-en-l@wikimedia.org. OTRS volunteers are tasked with handling cases that involves private or sensitive information. Thank you, — Dferg (talk) 18:38, 13 February 2010 (UTC)Reply
When emailing them, I received a response indicating refusal to work with this via email. I prefer not to post links to this site where search engines may find it, so... PCHS-NJROTC 01:27, 14 February 2010 (UTC)Reply
What ticket number did you get? FWIW, I think it already did get blacklisted.  — Mike.lifeguard | @en.wb 01:32, 14 February 2010 (UTC)Reply
I spoke to PCHS, the site has already been Added Added when Alison noticed it from the oversight request. James (T C) 06:34, 14 February 2010 (UTC)Reply

Essay spam

Links spammed by IPs 122.49.210.50, 62.80.184.178, 119.111.124.194, 24.107.14.189 and lots of single purpose accounts. See also research-service.com request































Already blacklisted:

--Jorunn 17:00, 16 February 2010 (UTC)Reply

Proposed additions (Bot reported)

This section is for domains which have been added to multiple wikis as observed by a bot.

These are automated reports, please check the records and the link thoroughly, it may report good links! For some more info, see Spam blacklist/Help#COIBot_reports. Reports will automatically be archived by the bot when they get stale (less than 5 links reported, which have not been edited in the last 7 days, and where the last editor is COIBot).

Sysops
  • If the report contains links to less than 5 wikis, then only add it when it is really spam
  • Otherwise just revert the link-additions, and close the report; closed reports will be reopened when spamming continues
  • To close a report, change the LinkStatus template to closed ({{LinkStatus|closed}})
  • Please place any notes in the discussion section below the HTML comment

The LinkWatchers report domains meeting the following criteria:

  • When a user mainly adds this link, and the link has not been used too much, and this user adds the link to more than 2 wikis
  • When a user mainly adds links on one server, and links on the server have not been used too much, and this user adds the links to more than 2 wikis
  • If ALL links are added by IPs, and the link is added to more than 1 wiki
  • If a small range of IPs have a preference for this link (but it may also have been added by other users), and the link is added to more than 1 wiki.
COIBot's currently open XWiki reports
List Last update By Site IP R Last user Last link addition User Link User - Link User - Link - Wikis Link - Wikis
bluemeranterquacksalber.blogspot.de 2024-09-24 00:14:20 COIBot 172.253.115.132 R Cherrynoglu
Fano
Heiko Gerber
2024-09-23 23:48:25 45 2
btobits.com 2024-09-23 22:55:45 COIBot 34.195.92.76 R OlehVasyliev 2024-09-22 21:53:43 16 12 0 0 4
casaimperialdemexico.com 2024-09-21 20:56:02 COIBot 213.130.145.229 Adrienne Kempelen 2024-09-21 19:58:47 22 16 0 0 13
tales.org.ua 2024-09-24 05:07:22 COIBot 185.68.16.90 Liubomyr Ch 2024-09-20 06:44:58 87 72 0 0 6

Proposed removals

This section is for proposing that a website be unlisted; please add new entries at the bottom of the section.

Remember to provide the specific domain blacklisted, links to the articles they are used in or useful to, and arguments in favour of unlisting. Completed requests will be marked as {{removed}} or {{declined}} and archived.

See also /recurring requests for repeatedly proposed (and refused) removals.

The addition or removal of a domain from the blacklist is not a vote; please do not bold the first words in statements.

for bdodderer.blogspot.com



I am doubt my edit is rejected when quote a link "dodderer.blogspot.com". I find here that a link "\bdodderer\.blogspot\.com\b" is banned here, however, there is no such page when I enter this url. I don't think the former link I quote is a spam. — The preceding unsigned comment was added by 210.184.231.8 (talk)

dodderer.blogspot.com is blacklisted here. It was blacklisted on the basis of this report.
"\bdodderer\.blogspot\.com\b" is the regex used to blacklist dodderer.blogspot.com. --Jorunn 16:19, 10 February 2010 (UTC)Reply

but I wonder is it spam? it is a page about education in chinese. 210.184.231.8 17:27, 10 February 2010 (UTC)Reply

"Spam" in Spam blacklist relates to the way the link gets inserted in the articles on the wikis and by whom, not to the content of the websites. According to the report this link was inserted in a bunch of articles on zh. and en.wikipedia by IPs from two Hong Kong IP-ranges, one of which your IP also belong. Adding links to websites you own or edit is concidered spamming. --Jorunn 22:03, 11 February 2010 (UTC)Reply

Thanks for your explanation first. I misunderstand the meaning with what "spam" mean in my email box.

Other hong kong people also refer to this? I would say it's not suprised, when this page is quite well-known in hong kong accounting and law fields, such as recommended by the hong kong institute of certified public accountants, the statutory body of hong kong accounting profession, and some other bodies, see the last page of the institute's newsletter http://app1.hkicpa.org.hk/APLUS/0901/Jan09_blogging.pdf as you may find this page is listed with a "law-maker" (Paul Chan is the representive of accounting industry in hong kong legislative council, Deloittle and Price are called "big 4 firms" in accounting field).

So I would say I am not wonder when the others using the same source as me, when talking about something related to accounting and law (and their education) e.g. Juris Doctor.210.184.231.8 08:50, 14 February 2010 (UTC)Reply

correiodamanha.pt



It's a daily newspaper, see en:Correio da Manhã. Maybe it's not the best example of newspaper (like The Sun in U.K.) but is one of the most selling newspapers in Portugal. It was reported in Talk:Spam_blacklist/Archives/2009-09#Xaman79_spam and added to blacklist although someone noted that it was a newspaper. Mosca 11:32, 11 February 2010 (UTC)Reply

By the way, I only noticed now they are redirecting correiodamanhã.pt to http://www.cmjornal.xl.pt/ but in portuguese wikipedia we have a lot of links to that redirect. Mosca 11:44, 11 February 2010 (UTC)Reply

adultwiki.net



I had recently added a link in good faith to a model page which was subsequently deemed as spam. I believed the link to be on topic and filled with relevent content but this was obviously not so. I would like the link to be reconsidered and if necessary I shall not repost it. 81.138.124.117 16:18, 11 February 2010 (UTC)Reply

Troubleshooting and problems

This section is for comments related to problems with the blacklist (such as incorrect syntax or entries not being blocked), or problems saving a page because of a blacklisted link. This is not the section to request that an entry be unlisted (see Proposed removals above).

None currently

Discussion

This section is for discussion of Spam blacklist issues among other users.

New regex behavour

Contrary to the warnings about ^ and $ we're probably all familiar with, those will soon match against the start & end of URLs. This change was introduced in rev:60869. I don't anticipate we should change old regexes, and this won't be extraordinarily useful for most use cases, but we should be aware of this for the future.  — Mike.lifeguard | @en.wb 17:02, 10 January 2010 (UTC)Reply

Hi!
Thanks for that information, but ^ will still be useless, because it would result in regexps like /http:\/\/^example.org/. This change affects $ only. -- seth 10:47, 31 January 2010 (UTC)Reply