1 |
< |
{\rtf1\ansi\ansicpg1252\cocoartf2578 |
1 |
> |
{\rtf1\ansi\ansicpg1252\cocoartf2639 |
2 |
|
\cocoatextscaling0\cocoaplatform0{\fonttbl\f0\fswiss\fcharset0 Helvetica;} |
3 |
|
{\colortbl;\red255\green255\blue255;} |
4 |
|
{\*\expandedcolortbl;;} |
5 |
< |
\margl1440\margr1440\vieww12600\viewh5580\viewkind0 |
5 |
> |
\margl1440\margr1440\vieww12600\viewh6980\viewkind0 |
6 |
|
\deftab720 |
7 |
|
\pard\pardeftab720\qc\partightenfactor0 |
8 |
|
|
9 |
|
\f0\fs28 \cf0 \ul \ulc0 ValExtLinks to-do list\ulnone \ |
10 |
|
\ |
11 |
|
\pard\pardeftab720\partightenfactor0 |
12 |
< |
\cf0 - Add argument --only-200-ok which doesn't treat the other entries in my OK_CODES array as OK. This may allow us to catch additional problems.\ |
12 |
> |
\cf0 - Don't rely on extlinks table to find external links, as only links within square brackets are tracked there. Use a Pywikibot script like find_external_images.py to find \ul all\ulnone external links. Consider porting entire script to Python. Will need to make sure to ignore links marked with <nowiki>.\ |
13 |
|
\ |
14 |
< |
- Audit exceptions list so it doesn't acquire cruft: mention URLs in list that aren't in extlinks.csv, and URLs that were found but returned different codes (or were OK instead of NG). This can be done with a parallel array of 0/1s that track whether each exception in the exceptions list was used.\ |
14 |
> |
- Add argument --only-200-ok which doesn't treat the other entries in my OK_CODES array as OK. This may allow us to catch additional problems.\ |
15 |
|
\ |
16 |
|
- Automatically highlight pages that do not have archive.org or archive.is snapshots (see {\field{\*\fldinst{HYPERLINK "http://mementoweb.org/depot/native/archiveis/"}}{\fldrslt here}} for the archive.is API). When checking for snapshots, we shouldn't just ask archive.org if it has one.\ |
17 |
|
\ |