Wikipedia:Link rot/URL change requests/Archives/2021/August

From Wikipedia, the free encyclopedia

seapower-digital.com hijacked by gambling site; references to be nullified

Would someone please run a bot through and nullify any reference that points to seapower-digital.com as that domain has been hijacked and is now a gambling site.

billinghurst sDrewth 13:00, 14 August 2021 (UTC)

Domain is globally blacklisted due to spambot activity, and has been locally whitelisted. Please ping me or ask another admin to remove it from the whitelist once the fixes have been undertaken. Thanks. — billinghurst sDrewth 03:23, 15 August 2021 (UTC)

Seapower is done. Basically 19 articles with a CS1|2 cite with a |unfit=. I can blacklist it in the IABOt database so it propagates out to other wiki sites, but it will be limited to only adding archives not flipping to |unfit=, a limitation of IABot, and WaybackMedic is presently limited to Enwiki and Commons. -- GreenC 04:29, 15 August 2021 (UTC)

Thanks. Don't worry about the xwiki, there was only a few, and they were done manually. It is blacklisted due to the spam, so it should be managed, and we can review if necessary. — billinghurst sDrewth 12:24, 16 August 2021 (UTC)

This covers many references to the fringe Journal of Cosmology. Headbomb {t · c · p · b} 03:22, 15 August 2021

Given that Journal of Cosmology was effectively a predatory journal, surely they shouldn't be directly cited anyway? Hemiauchenia (talk) 03:35, 15 August 2021 (UTC)
It's often cited as an example of nonsense e.g. [1]. Headbomb {t · c · p · b} 05:08, 15 August 2021 (UTC)
Passing on citation deletion, bot not designed, probably context sensitive. But will archive dead links. -- GreenC 04:52, 15 August 2021 (UTC)

It is done. Added archives to 23 citations in 11 articles. There are so few, deleting manually would not take long. I also updated IABot so it will archive on 120+ wikis (set blacklisted since the site is returning soft-404 200s). -- GreenC 01:51, 16 August 2021 (UTC)

On 56 globally including enwiki. -- GreenC 02:06, 16 August 2021 (UTC)
@GreenC: Saw the bot go to town on those links. Thanks. There's also the affiliated/mirror http://www.cosmology.com website, but this one isn't down (yet?). Might be a good idea to archive that one too. Headbomb {t · c · p · b} 03:05, 16 August 2021 (UTC)
@GreenC: Any updates? Headbomb {t · c · p · b} 22:09, 26 August 2021 (UTC)
On adding archive URLs for a domain not yet dead? The bot isn't really setup for that nobody has requested it before and there are some complications with square and bare URLs. The links are probably already archived at Wayback, when the domain dies archives can easily be added on wiki. -- GreenC 01:26, 27 August 2021 (UTC)

news.asiaone.com subdomain is dead

Please update IAbot to list news.asiaone.com subdomain as dead? The main site is still live. What they did was to shift all content under that subdomain to www.asiaone.com, and changing their information architecture totally, i.e. http://news.asiaone.com/news/showbiz/13-year-old-embarks-singing-path -> https://www.asiaone.com/entertainment/13-year-old-embarks-singing-path. Some of the content didn't survive the move i.e. http://news.asiaone.com/news/showbiz/bigbang-release-world-tour-movie (which somehow is present on their staging site. https://stage-a1.asiaone.com/entertainment/bigbang-release-world-tour-movie). – robertsky (talk) 17:30, 26 August 2021 (UTC)

@Robertsky: theoretically it might be possible to determine the new URL and do a move rather than archive, but anecdotally looking at a few links this does not look like a well maintained site, and the chances of running into more problems are high. So I think the best thing would be to process the entire domain looking for 404s and soft-404s (redirects to the home page etc..), including the news sub-domain. -- GreenC 19:03, 26 August 2021 (UTC)
GreenC, I have the list of new urls for /news/showbiz/ that is on enwiki actually after being alerted by Justanothersgwikieditor. Was looking into moving these urls, but after a search on the subdomain here, there are a lot more links which I have yet to determine where they are on the new permalink structure. – robertsky (talk) 19:10, 26 August 2021 (UTC)
Ok great! When you are ready and if you want help making the changes let me know, the bot can take a map as input. Any not on the map it can add archives. If you edit yourself that is fine there are three basic types: CS|1 templates, square links, bare links. It's usually a good idea to verify the new URL is working with a header check including following redirects, can't assume anything about the remote site working as it should, including soft-404s. If the old URL already has an archive URL in place, would either remove it, or keep it, depending on your philosophy (there are arguments either way) - currently I switch the archive URL to account for the new URL (assuming a new archive URL exists). -- GreenC 19:34, 26 August 2021 (UTC)
GreenC, I have gone through all the links on en.wiki (see excel file), and run them through Screaming Frog. Rows without new URLs or indicated as 403/404, you can set url-status=dead to the existing url. Those that are remaining are 200/301. For those old URL with archive URL, I don't have a preference. – robertsky (talk) 21:37, 26 August 2021 (UTC)

@Robertsky: The bot ran (August 27 2021 starting at 16:11 GMT), example diff. I decided to leave existing archive URLs as-is. It edited around 1,000 pages; 877 links were moved via the Excel map; about 600 new archive URLs added; about 99 {{dead link}} added; various other stuff. If you see anything it missed or other problems let me know. -- GreenC 00:18, 28 August 2021 (UTC)

GreenC, thanks! will circle back if there's an issue. :) – robertsky (talk) 06:47, 28 August 2021 (UTC)