Ibraheem Saleh
|
49d11e8869
|
Fix Poe parser and add font size controls
- Rewrite Poe extractor with section tracking: 51 clean poems
(was 108 with junk notes, prose, and dividers)
- Skip memoir, notes, prose poems, essays, dedications
- Properly extract from all 4 poem sections
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
|
1 week ago |
Ibraheem Saleh
|
e4464b6773
|
Add poetry collection: 3,155 poems from 15 Gutenberg sources
New files:
- download_poetry.py: Download/parse script with 15 extractors
- poetry/*.json: Pre-parsed poetry from Project Gutenberg
Poets included:
Shakespeare (154), Dickinson (439), Whitman (383),
Blake (43), Keats (10), Poe (108), E.B. Browning (44),
T.S. Eliot (5), Frost (82), Yeats (48), Khayyam (176),
Burns (563), Wordsworth (51), Shelley (1049)
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
|
1 week ago |