Cleaning up an import

Once you have imported a piece of legislation, it needs to be cleaned up before we publish it.

Content and structure is more important than matching exactly what the text looks like in the gazette (presentation).

pageThe importerpageIntroduction
  1. In the imported document, click Show source to see the pdf (if available) on the right-hand side of the screen.

  2. Remove superfluous text: The title, number, year, place, date, etc should all have been captured as metadata already and should show on the automatically generated coverpage (the top of the document in View mode). So those can come out, as well as the Table of Contents, which we also generate automatically (on the left-hand side). Click Save draft once this is done.

  3. Check the structure: Does the automatically generated Table of Contents reflect all the Chapter / Part / Section headings and numbers that are in the pdf? (Crossheadings are not reflected in the Table of Contents.)

    • If there are any differences, investigate and resolve them before doing anything else.

    • Click Save draft once this is done.

  4. Back to the top: Use the Preface, Long title and Preamble keywords as necessary. Save draft.

  5. Run 'Analysis | References' and 'Analysis | Italicised terms'. This will automatically link references and italicise certain non-English terms. Save draft.

  6. In general:

    • Do correct OCR (e.g. 0 instead of O) and whitespace (e.g. TheBoard instead of The Board) errors.

    • Don't correct spelling, grammar, punctuation, numbering, or cross-references; rather leave a comment starting with 'note:', e.g. 'note: numbering as in original'. (There's no need to actively look for these types of errors, but it's fine for you to note them when you do notice them.)

    • If text was incorrectly linked or italicised automatically earlier, remove it manually. See Working with links and Working with italicised terms.

    • Save often :)

  7. Structure:

  8. Sections:

    • Check the first and last words of paragraphs against the PDF.

    • Check line breaks: should two paragraphs be joined together, or one paragraph be split into two?

    • Check indents, especially of dangling text after the end of a list of paragraphs.

    • Briefly scan the text for OCR errors – but no need to check word for word.

  9. You may also need to insert a table, insert an image, or work with Schedules.

  10. Before submitting the task for review:

  11. Submitting the task for review:

    • On the tasks page there is an "Add a comment" button if you need to alert the reviewer of anything before submitting for review.

    • Submit for review.

Last updated