Ignore:
Timestamp:
Feb 2, 2021, 9:10:39 PM (4 years ago)
Author:
iritscen
Message:

ValExtLinks: Changed --suggest-snapshots to --suggest-snapshots-ng and added --suggest-snapshots-ok for getting snapshot URLs for all good links. This can be used to confirm that sites are backed up in case they die in the future, but note that this argument will take hours to run due to the API rate limit. Added awareness of API rate limit so Archive.org will not start blocking script.

File:
1 edited

Legend:

Unmodified
Added
Removed
  • Validate External Links/Sample files/ValExtLinks report.htm

    r1144 r1147  
    55<body>
    66<h2>Validate External Links report</h2>
    7 <h3>generated Sun, 06 Sep 2020 17:19:16 GMT<br />
    8 from data of Sun, 06 Sep 2020 14:20:01 GMT
     7<h3>generated Tue, 02 Feb 2021 17:37:31 GMT<br />
     8from data of Tue, 02 Feb 2021 15:20:01 GMT
    99<br />
    1010script by Iritscen (<a href="http://iritscen.oni2.net" target="_blank">contact</a>)</h3><br />
     
    1414Downloading list of reporting exceptions from https://wiki.oni2.net/Validate_External_Links/Exceptions...
    1515 success.<br />
    16 Found 3134 links to process.<br />
     16Found 3191 links to process.<br />
    1717<br />
    1818<h3>Config</h3>
    1919Links to consider:
    20 3134<br />
     203191<br />
    2121Site query timeout: 10 seconds<br />
    2222Show OK links:
     
    2424Take screenshots:
    2525No<br />
    26 Suggest archive.org snapshots:
     26Suggest archive.org snapshots for NG pages:
    2727Yes<br />
     28Suggest archive.org snapshots for OK pages:
     29No<br />
    2830Ignore slash-adding redirects:
    2931Yes<br />
     
    5052<h3>Links</h3>
    5153<table>
    52 <tr><td style="white-space:nowrap">NG (000-28)</td><td align="right">page</td><td><a href="http://www.okita.com/home.htm" target="_blank">http://www.okita.com/home.htm</a></td></tr>
    53 <tr><td colspan="2" align="right">linked from</td><td><a href="https://wiki.oni2.net/Credits" target="_blank">Credits</a></td></tr>
    54 <tr><td colspan="2" align="right">IA suggests</td><td><a href="http://web.archive.org/web/20060702100621/http://www.okita.com/home.htm" target="_blank">http://web.archive.org/web/20060702100621/http://www.okita.com/home.htm</a></td></tr><tr><td>&nbsp;</td></tr>
    55 <tr><td style="white-space:nowrap">NG (500)</td><td align="right">page</td><td><a href="https://bytes.com/topic/visual-basic-net/answers/630751-how-do-i-pass-command-line-another-instance-my-application-already-running" target="_blank">https://bytes.com/topic/visual-basic-net/answers/630751-how-do-i-pass-command-line-another-instance-my-application-already-running</a></td></tr>
    56 <tr><td colspan="2" align="right">linked from</td><td><a href="https://wiki.oni2.net/Talk:Mod_Tool/OniTools_addon" target="_blank">Talk:Mod_Tool/OniTools_addon</a></td></tr>
    57 <tr><td colspan="2" align="right">IA suggests</td><td><a href="http://web.archive.org/web/20150926205808/http://bytes.com/topic/visual-basic-net/answers/630751-how-do-i-pass-command-line-another-instance-my-application-already-running" target="_blank">http://web.archive.org/web/20150926205808/http://bytes.com/topic/visual-basic-net/answers/630751-how-do-i-pass-command-line-another-instance-my-application-already-running</a></td></tr><tr><td>&nbsp;</td></tr>
     54<tr><td style="white-space:nowrap">NG (000-28)</td><td align="right">page</td><td><a href="https://www.tagesspiegel.de/wissen/patienten-ueber-80-jahre-werden-nicht-mehr-beatmet-deutsche-katastrophenaerzte-verfassen-alarmbericht-ueber-strassburg/25682596.html" target="_blank">https://www.tagesspiegel.de/wissen/patienten-ueber-80-jahre-werden-nicht-mehr-beatmet-deutsche-katastrophenaerzte-verfassen-alarmbericht-ueber-strassburg/25682596.html</a></td></tr>
     55<tr><td colspan="2" align="right">linked from</td><td><a href="https://wiki.oni2.net/User:Paradox-01/brain_dump" target="_blank">User:Paradox-01/brain_dump</a></td></tr>
     56<tr><td colspan="2" align="right">IA suggests</td><td><a href="http://web.archive.org/web/20210125232036/https://www.tagesspiegel.de/wissen/patienten-ueber-80-jahre-werden-nicht-mehr-beatmet-deutsche-katastrophenaerzte-verfassen-alarmbericht-ueber-strassburg/25682596.html" target="_blank">http://web.archive.org/web/20210125232036/https://www.tagesspiegel.de/wissen/patienten-ueber-80-jahre-werden-nicht-mehr-beatmet-deutsche-katastrophenaerzte-verfassen-alarmbericht-ueber-strassburg/25682596.html</a></td></tr><tr><td>&nbsp;</td></tr>
     57<tr><td style="white-space:nowrap">NG (000-28)</td><td align="right">page</td><td><a href="http://www.aegisub.org/" target="_blank">http://www.aegisub.org/</a></td></tr>
     58<tr><td colspan="2" align="right">linked from</td><td><a href="https://wiki.oni2.net/Capturing_game_footage" target="_blank">Capturing_game_footage</a></td></tr>
     59<tr><td colspan="2" align="right">IA suggests</td><td><a href="http://web.archive.org/web/20201215063947/http://www.aegisub.org" target="_blank">http://web.archive.org/web/20201215063947/http://www.aegisub.org</a></td></tr><tr><td>&nbsp;</td></tr>
     60<tr><td style="white-space:nowrap">RD (301)</td><td align="right">page</td><td><a href="http://www.flickr.com/photos/34154526@N07/4674281745/" target="_blank">http://www.flickr.com/photos/34154526@N07/4674281745/</a></td></tr>
     61<tr><td colspan="2" align="right">linked from</td><td><a href="https://wiki.oni2.net/Oni2:Slaves_of_War/Neo-Biology" target="_blank">Oni2:Slaves_of_War/Neo-Biology</a></td></tr>
     62<tr><td colspan="2" align="right">Server suggests</td><td><a href="https://www.flickr.com:443/photos/34154526@N07/4674281745/" target="_blank">https://www.flickr.com:443/photos/34154526@N07/4674281745/</a></td></tr><tr><td>&nbsp;</td></tr>
    5863</table><br /><br />
    59 <h3><span id="summary">Summary (19 min. 9 sec. elapsed)</span></h3>
    60 I finished processing 3134 of 3134 links (there were 703 file links and 2204 page links).<br />
    61 3134 processed links:<br />
    62 - 9 links could not be processed<br />
    63 - 218 Archive.org links were not checked<br />
    64 - 18 processed links had issues<br />
    65 &nbsp;&nbsp;(excepted 16 links from report)<br />
    66 - 2889 processed links were OK<br />
    67 &nbsp;&nbsp;(counted 673 trivial redirections as OK)<br />
    68 9 link errors (see <a href="ValExtLinks report.rtf" target="_blank">RTF</a> or <a href="ValExtLinks report.txt" target="_blank">TXT</a> report for specific links):<br />
     64<h3><span id="summary">Summary (21 min. 9 sec. elapsed)</span></h3>
     65I finished processing 3191 of 3191 links (there were 705 file links and 2216 page links).<br />
     663191 processed links:<br />
     67- 11 links could not be processed<br />
     68- 262 Archive.org links were not checked<br />
     69- 22 processed links had issues<br />
     70&nbsp;&nbsp;(excepted 19 links from report)<br />
     71- 2896 processed links were OK<br />
     72&nbsp;&nbsp;(counted 665 trivial redirections as OK)<br />
     7311 link errors (see <a href="ValExtLinks report.rtf" target="_blank">RTF</a> or <a href="ValExtLinks report.txt" target="_blank">TXT</a> report for specific links):<br />
    6974- 2 missing/unknown namespaces<br />
    7075- 6 links on JavaScript pages<br />
    71 - 1 unknown URL suffix<br />
    72 16 link problems excepted (see <a href="ValExtLinks report.rtf" target="_blank">RTF</a> or <a href="ValExtLinks report.txt" target="_blank">TXT</a> report for specific links):<br />
    73 - 9/11 NG links<br />
    74 - 4/4 redirections<br />
    75 - 2/2 external internal links<br />
     76- 3 unknown response codes<br />
     7719 link problems excepted (see <a href="ValExtLinks report.rtf" target="_blank">RTF</a> or <a href="ValExtLinks report.txt" target="_blank">TXT</a> report for specific links):<br />
     78- 10/12 NG links<br />
     79- 5/6 redirections<br />
     80- 3/3 external internal links<br />
    7681- 1/1 potential intrawiki link<br />
    77 2 link issues:<br />
     823 link issues:<br />
    7883- 2 NG links<br />
     84- 1 redirection<br />
    7985ValExtLinks says goodbye.<br />
    8086</body>
Note: See TracChangeset for help on using the changeset viewer.