test(discovery/crawler): capture golden fixtures from five sources

This commit is contained in:
2026-04-18 11:45:53 +02:00
parent 5135f0a3be
commit e359d06d13
9 changed files with 18598 additions and 0 deletions

View File

@@ -10,6 +10,7 @@ repos:
exclude: ^web/tsconfig\.json$
- id: check-merge-conflict
- id: check-added-large-files
exclude: ^backend/internal/domain/discovery/crawler/testdata/
- id: no-commit-to-branch
args: ['--branch', 'main']

View File

@@ -0,0 +1,16 @@
# Crawler test fixtures
Captured with curl using the default Firefox UA on 2026-04-18.
These are the exact bytes each source served at capture time. If a parser starts
failing after a site redesign, re-capture the corresponding file with the curl
commands documented in `docs/superpowers/plans/2026-04-18-dach-discovery-crawler.md`
(Task 2) and update the parser's expected assertions.
- `marktkalendarium.html` — https://www.marktkalendarium.de/maerkte2026.php
- `mittelalterkalender.html` — https://www.mittelalterkalender.info/mittelaltermarkt/mittelalterfeste-2026-nach-datum.php
- `festival_alarm.html` — https://www.festival-alarm.com/Kategorien/Mittelalter-Festivals/(year)/2026
- `mittelaltermarkt_online_page1.json` — Tribe REST API page 1
- `mittelaltermarkt_online_page2.json` — Tribe REST API page 2
- `suendenfrei_page1.html` — https://www.suendenfrei.tv/veranstaltungen
- `suendenfrei_page2.html` — https://www.suendenfrei.tv/veranstaltungen/page-2

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

File diff suppressed because one or more lines are too long

File diff suppressed because one or more lines are too long

File diff suppressed because one or more lines are too long

File diff suppressed because one or more lines are too long

File diff suppressed because one or more lines are too long