Additions to the database and feedWatcher

feedWatcher: added the parsing of HTML feeds to get the title tag in the
    <head> area; new database fields relating to the copyright check
    done, and why the feed was allowed in if done so manually; added dry
    run mode; changed the way -load and -delete work so each can be
    given URLs on the command line; starting to report settings at start
    time (needs work); -load and -delete not allowed together; more
    logging; addition of a _debug function; enhancement of reportFeed to
    show one feed and a summary of relevant details (more useful than
    dumping the entire database this way); added getHTMLTitle for
    parsing out the HTML title; enhanced checkCopyright to get a reason
    if in manual mode and a feed is allowed in; needs a lot of
    clean-up!

feedWatcher.{html,json,mkd,opml,pdf}: various reports.

feedWatcher_3.tpl: For making Markdown which is turned into PDF.
    'Licence' becomes 'Copyright'

feedWatcher_5.tpl: for dumping all the URLs in the database
    & regenerating everything

feedWatcher_schema.sql: new fields added
This commit is contained in:
Dave Morriss
2023-01-14 23:13:49 +00:00
parent 01ec2cf92f
commit db39655199
10 changed files with 1047 additions and 393 deletions

View File

@@ -29,6 +29,8 @@ CREATE TABLE urls (
link varchar(1024),
image varchar(1024),
copyright varchar(80),
check_type varchar(10) DEFAULT 'none',
reason_accepted text,
generator varchar(80),
language varchar(40),
parent_id integer
@@ -131,6 +133,8 @@ CREATE VIEW all_episodes AS
urls.link as urls_link,
urls.image as urls_image,
urls.copyright as urls_copyright,
urls.check_type as urls_check_type,
urls.reason_accepted as urls_reason_accepted,
urls.generator as urls_generator,
urls.language as urls_language,
urls.parent_id as urls_parent_id,