Wikipedia:Link rot/URL change requests/Archives/2021/December

From Wikipedia, the free encyclopedia

who.int

Some links to www.who.int are now broken after the site was updated (on this page, for example). Can these links be updated? Jarble (talk) 01:28, 7 December 2021 (UTC)

The example link http://www.who.int/mental_health/policy/services/Belize.pdf appears to work. -- GreenC 03:29, 7 December 2021 (UTC)
@GreenC: But the other example link (here) is still broken. Jarble (talk) 16:25, 7 December 2021 (UTC)
Do you know if it moved a new URL, or is it just some links are now dead? -- GreenC 17:57, 7 December 2021 (UTC)
@GreenC: I found it on the Internet Archive, but I don't know if it moved to another URL. Jarble (talk) 00:50, 8 December 2021 (UTC)
Jarble OK WHO links exist in about 5,000 pages. Most are working, but some like https://www.who.int/gho/publications/world_health_statistics/2016/Annex_B/en/ are dead with status 200 ie. a soft-404. IABot would not be able to detect those. This would be a good fit for custom work. I'll take a look once finished the above project with Billboard a couple days. -- GreenC 03:33, 8 December 2021 (UTC)
Interesting the PDF content is still there, but they added a JavaScript redirect (window.location.replace("https://www.who.int/home/cms-decommissioning");) to indicate it has been decommissioned, so you see the original page flash by as it loads then redirects away. Argh. There is actually a new page for the content, for example old is now at new (sort of - the old page had more intent). Determining the new from the old is possible, in this case, but would not be in this case. It could be a messy and error prone job to migrate, why they probably didn't do it themselves with that message basically saying search for it yourself maybe you'll find it. So I think any page that has the JavaScript redirect to cms-decommissioning should be treated as dead and archives added. It will be more clear once I start running can check the logs to see what the data shows. -- GreenC 04:12, 8 December 2021 (UTC)

Results

  • Checked 6,403 pages
  • Edited 2,635 pages
  • Added 3,181 archive URLs for decommisioned URLs Example
  • Added 761 archive URLs for soft-404s Example
    • Soft-404 regex (see inline comment)

@Jarble: completed. Thanks for the notice. -- GreenC 05:06, 13 December 2021 (UTC)

Gartner Newsroom

There are ~120 articles that link to www.gartner.com/newsroom/id/<number>. These URLs all redirect to https://gartner.com/en/newsroom, making them functionally dead. * Pppery * it has begun... 18:33, 9 December 2021 (UTC)

OK. After who.int is completed (days) -- GreenC 06:24, 10 December 2021 (UTC)

Results

  • Articles checked: 668
  • Articles edited: 265
  • New archive URL added: 228
  • Existing |url-status=live changed to |url-status=dead: 54
  • Add {{dead link}}: 64
  • Swap old URL with new redirected URL: 24

It includes other soft404 types discovered, in addition to newsroom/id/<number> .. any problems let me know, thanks. @Pppery: -- GreenC 02:00, 15 December 2021 (UTC)

Official Charts (Germany)

Hello. There are a lot of articles using the old website URL officialcharts.de that is now offiziellecharts.de. The problem is, there is no clear swapping between the URLs. For example, this is now that. Therefore, I would like to request an archive copy of many of these links as possible in the search results above while tagging deadlink to the ones that can't be archived. These account for 2300+ links. The dead links would need to be manually swapped over to the new URL. As that would be time-consuming, I would like to see how many links can be salvaged. Thanks! --MrLinkinPark333 (talk) 22:01, 9 December 2021 (UTC)

Ok after Gartern Newsroom is done days to week+ -- GreenC 06:25, 10 December 2021 (UTC)

Results

  • New archive link: 732
  • Switch existing |url-status=live to dead: 64
  • Add {{dead link}}: 787

@MrLinkinPark333: results are in looks like it was about 50/50 archive vs. dead link. -- GreenC 02:55, 17 December 2021 (UTC)

@GreenC: Not too bad. Thanks! --MrLinkinPark333 (talk) 02:58, 17 December 2021 (UTC)
Wikipedia:Link rot/cases/officialcharts.de has the list in case you ever need it. -- GreenC 03:07, 17 December 2021 (UTC)

Fossilworks

Moved from WP:BOTREQ

Fossilworks, a website that acts as a mirror to the Paleobiology Database, has been down for over a month now, with no sign that the website will come back online. We currently have over 7,000 links to this website per fossilworks.org HTTPS links HTTP links Entries on the Paleobiology database are stored in a numerical string, shared for fossilworks and the paleobiology database website. For example the Killer whale on fossilworks is: http://fossilworks.org/bridge.pl?a=taxonInfo&taxon_no=64541, which is equivalent to the Paleobiology database entry https://paleobiodb.org/classic/basicTaxonInfo?taxon_no=64541. Would it be possible to create a bot to automatically take a fossilworks url and convert the entry to the working Paleobiology Database url? Hemiauchenia (talk) 01:14, 16 December 2021 (UTC) GreenC 01:26, 16 December 2021 (UTC)

Hemiauchenia, I can work on this. -- GreenC 01:28, 16 December 2021 (UTC)
I should have mentioned this initially, but there are two types of entry, the first I have already mentioned is the taxon entry, the second is the collections entry, which has a similar but different set numerical strings. For instance, the fossilworks entry for the "Rio Fras" collection is http://fossilworks.org/bridge.pl?a=collectionSearch&collection_no=176666 while the equivalent url in the Paleobiology database is https://paleobiodb.org/classic/displayCollResults?a=basicCollectionSearch&collection_no=176666 . My apologies for not mentioning that to begin with. Hemiauchenia (talk) 01:32, 16 December 2021 (UTC)
No problem. Will log everything that doesn't fit these two patterns, will let you know; we can work out what it should be if anything (otherwise archive). Might be a few days before the bot is tooled and test dry runs. -- GreenC 01:36, 16 December 2021 (UTC)
@Hemiauchenia: please see discussions Template_talk:Fossilworks. Unclear the site is dead eg. [1] - I believe there will need to be migrations for example http://fossilworks.org/bridge.pl?a=taxonInfo&taxon_no=64541 is now http://www.fossilworks.org/cgi-bin/bridge.pl?a=taxonInfo&taxon_no=64541 .. we need to create migration rules I started a discussion on the page -- GreenC 17:16, 16 December 2021 (UTC)
@Hemiauchenia: done. All former http://fossilworks.org URLs should be working now, migrated to the new form. There were about 10,000 URLs in 7,300 articles. -- GreenC 04:10, 18 December 2021 (UTC)