Skip to content

Conversation

@stefaneng
Copy link
Member

No description provided.

@stefaneng stefaneng marked this pull request as draft November 9, 2025 02:40
@jacksonloper
Copy link
Collaborator

jacksonloper commented Nov 10, 2025

Looking good! I wish the pdf document from Michigan made the sub-headings more obvious how the hierarchy of bullets was supposed to go. I think your parser gets it wrong sometimes, e.g.

Screenshot 2025-11-10 at 11 57 22 AM

but honestly not completely clear on what the system even is in these documents for defining subheadings.

@jacksonloper
Copy link
Collaborator

jacksonloper commented Nov 10, 2025

Chatgpt just told me that this pdf is a "warcrime against indentation." Chatgpt is off its rocker, but its point may be right. It maybe just be better to include just the text and the history, rather then the subbulleting, because it is genuinely ambiguous. For example, consider this

...
(u) Cheeses include
(i) Cheddar
(ii) Gouda
(iii) Blue
(iv) Swiss
(v) The Dutch

Sane people could disagree about whether (v) is meant to be part of the roman-numeral-list or part of the higher level letter-list (after u).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants