test(discovery/crawler): capture golden fixtures from five sources
This commit is contained in:
@@ -10,6 +10,7 @@ repos:
|
||||
exclude: ^web/tsconfig\.json$
|
||||
- id: check-merge-conflict
|
||||
- id: check-added-large-files
|
||||
exclude: ^backend/internal/domain/discovery/crawler/testdata/
|
||||
- id: no-commit-to-branch
|
||||
args: ['--branch', 'main']
|
||||
|
||||
|
||||
16
backend/internal/domain/discovery/crawler/testdata/README.md
vendored
Normal file
16
backend/internal/domain/discovery/crawler/testdata/README.md
vendored
Normal file
@@ -0,0 +1,16 @@
|
||||
# Crawler test fixtures
|
||||
|
||||
Captured with curl using the default Firefox UA on 2026-04-18.
|
||||
|
||||
These are the exact bytes each source served at capture time. If a parser starts
|
||||
failing after a site redesign, re-capture the corresponding file with the curl
|
||||
commands documented in `docs/superpowers/plans/2026-04-18-dach-discovery-crawler.md`
|
||||
(Task 2) and update the parser's expected assertions.
|
||||
|
||||
- `marktkalendarium.html` — https://www.marktkalendarium.de/maerkte2026.php
|
||||
- `mittelalterkalender.html` — https://www.mittelalterkalender.info/mittelaltermarkt/mittelalterfeste-2026-nach-datum.php
|
||||
- `festival_alarm.html` — https://www.festival-alarm.com/Kategorien/Mittelalter-Festivals/(year)/2026
|
||||
- `mittelaltermarkt_online_page1.json` — Tribe REST API page 1
|
||||
- `mittelaltermarkt_online_page2.json` — Tribe REST API page 2
|
||||
- `suendenfrei_page1.html` — https://www.suendenfrei.tv/veranstaltungen
|
||||
- `suendenfrei_page2.html` — https://www.suendenfrei.tv/veranstaltungen/page-2
|
||||
1753
backend/internal/domain/discovery/crawler/testdata/festival_alarm.html
vendored
Normal file
1753
backend/internal/domain/discovery/crawler/testdata/festival_alarm.html
vendored
Normal file
File diff suppressed because it is too large
Load Diff
8329
backend/internal/domain/discovery/crawler/testdata/marktkalendarium.html
vendored
Normal file
8329
backend/internal/domain/discovery/crawler/testdata/marktkalendarium.html
vendored
Normal file
File diff suppressed because it is too large
Load Diff
8035
backend/internal/domain/discovery/crawler/testdata/mittelalterkalender.html
vendored
Normal file
8035
backend/internal/domain/discovery/crawler/testdata/mittelalterkalender.html
vendored
Normal file
File diff suppressed because one or more lines are too long
1
backend/internal/domain/discovery/crawler/testdata/mittelaltermarkt_online_page1.json
vendored
Normal file
1
backend/internal/domain/discovery/crawler/testdata/mittelaltermarkt_online_page1.json
vendored
Normal file
File diff suppressed because one or more lines are too long
1
backend/internal/domain/discovery/crawler/testdata/mittelaltermarkt_online_page2.json
vendored
Normal file
1
backend/internal/domain/discovery/crawler/testdata/mittelaltermarkt_online_page2.json
vendored
Normal file
File diff suppressed because one or more lines are too long
231
backend/internal/domain/discovery/crawler/testdata/suendenfrei_page1.html
vendored
Normal file
231
backend/internal/domain/discovery/crawler/testdata/suendenfrei_page1.html
vendored
Normal file
File diff suppressed because one or more lines are too long
231
backend/internal/domain/discovery/crawler/testdata/suendenfrei_page2.html
vendored
Normal file
231
backend/internal/domain/discovery/crawler/testdata/suendenfrei_page2.html
vendored
Normal file
File diff suppressed because one or more lines are too long
Reference in New Issue
Block a user