This repository was originally called hpr-admin but has been cleaned up and rebuilt as hpr-tools. It contains the various tools used behind the scenes to prepare shows for release, upload them to the Internet Archive and other things.
Go to file
Dave Morriss 31eb5d200f Updates for missing asset "repair"
InternetArchive/recover_transcripts: Bash script to be run on 'borg'
    which collects files missing on the IA ready for upload as part of
    the missing asset repair process.

InternetArchive/repair_assets: Bash script to take assets from the IA
    (after they had been repaired on 'borg') and copy them to the HPR
    server for the notes to access. The local machine, where this was
    run, was used to store files being uploaded. The planned script to
    modify the notes to reflect the new file locations was never
    finished. Notes were edited with Vim using a few macros.

InternetArchive/repair_item: Bash script which is best run on 'borg',
    which repairs an IA item by comparing the files on the IA with the
    files on 'borg' (or a local machine). These files are either in
    '/data/IA/uploads/' or in the temporary file hierarchy used by
    'recover_transcripts' (which calls it). Used after a normal IA
    upload to check for and make good any missed file uploads (due to
    timeouts, etc). Also used during asset repairs, but that project is
    now finished.

InternetArchive/snapshot_metadata: Bash script which collects detailed
    metadata from the IA in JSON format and saves it locally (run on
    a local PC). Older shows on the IA often contained derivative files
    which were identified by the script 'view_derivatives'. These files
    were never needed, they were IA artefacts, so can be deleted (see
    the script header for how).

InternetArchive/view_derivatives: Perl script to interpret a file of
    JSON metadata from the IA for an HPR show in order to determine the
    parent-child hierarchy of files where there may be derivatives. We
    don't want IA-generated derivatives, but this process was hard to
    turn off in earlier times. Generates a hierarchical report and
    a list of unwanted derivatives (see 'snapshot_metadata' for more
    details of how this was used).
2024-11-23 22:28:52 +00:00
Comment_system Moved project directories and files to an empty local repo 2024-06-04 16:35:44 +01:00
Community_News Moved project directories and files to an empty local repo 2024-06-04 16:35:44 +01:00
Database Updates since 2024-06-15 2024-07-16 21:39:28 +01:00
FAQ Updates from previous repo 2024-06-14 16:00:04 +01:00
InternetArchive Updates for missing asset "repair" 2024-11-23 22:28:52 +00:00
Link_Checker Updates from previous repo 2024-06-14 16:00:04 +01:00
Miscellaneous Attempting to generate fix_tags.bin 2024-06-14 23:13:45 +01:00
PostgreSQL_Database Updates from previous repo 2024-06-14 16:00:04 +01:00
Show_Submission Moved project directories and files to an empty local repo 2024-06-04 16:35:44 +01:00
.gitignore Moved project directories and files to an empty local repo 2024-06-04 16:35:44 +01:00
LICENSE Initial commit 2024-06-04 15:06:29 +00:00
README.md Initial commit 2024-06-04 15:06:29 +00:00

hpr-tools

This repository was originally called hpr-admin but has been cleaned up and rebuilt as hpr-tools.

It contains the various tools used behind the scenes to prepare shows for release, upload them to the Internet Archive and other things.