Skip to content

Paper skill, /papers/ rebrand, and AI-slop purge#6

Open
williamjameshandley wants to merge 8 commits into
mainfrom
workshop/paper-skill-baseline
Open

Paper skill, /papers/ rebrand, and AI-slop purge#6
williamjameshandley wants to merge 8 commits into
mainfrom
workshop/paper-skill-baseline

Conversation

@williamjameshandley

@williamjameshandley williamjameshandley commented May 5, 2026

Copy link
Copy Markdown
Contributor

Summary

Adds infrastructure for the new PR-driven paper-post workflow agreed at the 5 May 2026 workshop:

  1. Baseline paper skill at skills/paper/SKILL.md — replaces the auto-publish script/posts/arxiv.py workflow. Drafts go on paper/<arxiv-id> branches and merge only after human review by someone who is not the paper's lead author.

  2. Purge of the auto-generated paper backlog — 88 _posts/*.md and 82 assets/images/posts/*.png files removed. These were shipped to main without human review. Anything worth re-publishing is recoverable from git history; new posts go through this skill.

  3. Post layout improvements:

    • Prominent arXiv badge near the top of each paper post, keyed off the new arxiv: frontmatter field.
    • First-page screenshot auto-included as a small floated thumbnail (~150 px) if assets/images/papers/<arxiv-id>.png exists. Click-through to PDF.
    • MathJax updated from a broken v2 reference to v3.
    • Authors line moved to the body (so links work) rather than duplicated in post-meta.
  4. /papers/ index treatment — each entry is a card-row with thumbnail on the left and date / arXiv badge / title / authors on the right. Cards without an arxiv id render with a placeholder.

  5. Tone constraints baked into the skill: exact paper title, no press-release language, no synthetic AI illustrations, ~500 words max, mandatory "How AI helped (or didn't)" section as the differentiator from a bare arXiv abstract.

What's NOT in this PR

The first paper post written through this skill (Toby Lovick's ALCS paper, arXiv:2603.26644) was originally part of this PR but has been split out. It will land via a separate PR tagging @tobyLovick so he can review and adjust the "How AI helped" section before merge.

Test plan

  • script/cibuild exits 0
  • /papers/ renders empty index intro until the first PR-reviewed post lands
  • Post layout renders correctly when given a fixture post (verified before split)
  • No remaining auto-generated content in _posts/ or assets/images/posts/

Refs #3

Replaces the auto-publish `script/posts/arxiv.py` workflow that produced
~88 unreviewed posts. New posts are drafted via this skill on a
`paper/<arxiv-id>` branch and merged only after human review by someone
who is not the paper's lead author.

Tone constraints baked in:
- exact paper title (no clever rewrites)
- no press-release language
- no AI-generated body text and no synthetic AI illustrations
- 200-500 words; the paper is the long-form artefact
- explicit "How AI helped (or didn't)" section is the differentiator;
  honest "didn't help" answers are acceptable

This is a baseline produced during the 5 May 2026 workshop. The first
real paper post written through it (Toby Lovick's latest ALCS paper) is
expected to drive the next iteration.

Refs #3
Removes 88 .md files from `_posts/` and 82 .png illustrations from
`assets/images/posts/`. These were generated by the legacy
`script/posts/arxiv.py` pipeline and shipped to `main` without human
review. The 5 May 2026 workshop converged on phasing out that pipeline
in favour of PR-reviewed paper posts via the `paper` skill (added
elsewhere in this PR).

Effect: `/papers/` now renders only its index intro (no posts listed)
until the first PR-reviewed post lands.

Recovery: anything worth re-publishing is recoverable from git history
(`git show <commit>:<path>`); copy back to `_posts/` only via the
`paper` skill workflow with human review.

Also gitignores the orchestrator-day `context/` scratch directory and
`.playwright-mcp/` browser cache, which were accidentally included in
the previous commit.
@williamjameshandley williamjameshandley force-pushed the workshop/paper-skill-baseline branch from e09a71d to 301078a Compare May 5, 2026 11:39
Toby Lovick, David Yallup, Will Handley — "Automatic Laplace Collapsed
Sampling: Scalable Marginalisation of Latent Parameters via Automatic
Differentiation". The first post produced through skills/paper, included
in this PR as a worked example of what the skill produces from a single
arXiv ID.

Per the skill's tone constraints: paper title verbatim; no synthetic AI
illustration; no press-release language; ~140 words on the science. The
"How AI helped" section is left as a TODO for Toby, because the skill
explicitly requires the lab author to fill it — it's the load-bearing
differentiator from the arXiv abstract and not the agent's voice.

Reviewer should not be Toby (the lead author). The TODO blocks need
filling before the post is "done"; merging this PR ships them publicly
as TODOs, which is intentional during the workshop iteration phase.

Refs #3
The post layout now keys off the `arxiv:` frontmatter field:
- a clickable arXiv-id badge renders near the top of the post header
- if `assets/images/papers/<arxiv-id>.png` exists, it auto-renders as a
  figure linked to the PDF, captioned "First page of arXiv:<id>"

This means paper post markdown no longer repeats the arXiv link as the
first body line — the badge handles it. Body content starts with the
human authors line, then "What the paper does", then "How AI helped".

Includes:
- _layouts/post.html: rewritten header + first-page figure block;
  MathJax updated to v3 (was a broken reference to mathjax.org/latest)
- _sass/minima/custom-styles.scss: arxiv-badge and paper-firstpage styles
- assets/images/papers/2603.26644.png: rendered first page (pdftoppm at
  150 DPI) of the ALCS paper, ~340 KB
- skills/paper/SKILL.md: documents the load-bearing `arxiv:` field,
  the auto-included first-page screenshot, and the pdftoppm command;
  drops the previous "manual cropped figure" wording
- _posts/2026-03-27-2603.26644.md: drops the redundant first body line
  pointing at the paper (now the badge does that)

Refs #3
/papers/ list: each entry is now a card-row with the first-page
thumbnail on the left, date, arXiv badge, title, authors, and an
optional excerpt on the right. Posts without an arxiv frontmatter
field still render with a placeholder thumbnail box.

Post page: the first-page figure is now a 150px floated thumbnail
beside the header rather than a full-width inline figure. Caption
is hidden (the badge already labels the source). On narrow viewports
it falls back to centered above the body.
Splitting PR #6 into two: this branch keeps the paper skill, the auto-
generated-post purge, the post-layout improvements (arXiv badge +
first-page thumbnail) and the /papers/ index card-row treatment.

Toby's ALCS paper post (`_posts/2026-03-27-2603.26644.md` and
`assets/images/papers/2603.26644.png`) lands in a follow-up PR
authored on his behalf so he can review and adjust the 'How AI helped'
section before merge.
@williamjameshandley williamjameshandley changed the title Baseline paper skill for PR-driven paper posts Paper skill, /papers/ rebrand, and AI-slop purge May 5, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant