073e55c7fc48e3ef04863bc4e9d479fe45a7445c
- Service.Crawl no longer link-verifies Quellen/Website for crawler events. Those URLs come from real HTML of trusted sources and have been implicitly verified at parse time. Removing this makes the insert phase complete in well under a minute even for 1500+ events and stops attributing timing-limited processing as link failures. LinkCheckFailed counter retained for JSON shape stability. - Suendenfrei pagination now stops on len(events) == 0. Previously the site's footer <h3><a> links kept anchors.Length() > 0 indefinitely, sending the crawler to page-90 before the outer ctx timeout. - New similarity helper (SimilarityScore, FindSimilar) and endpoint GET /api/v1/admin/discovery/queue/:id/similar. Multiplicative score of normalized-name Levenshtein ratio gating city-match and date- proximity bonuses. Prevents coincident-city/date events from being incorrectly flagged as near-duplicates when their names differ. Lets admin review flag near-duplicates that slip past exact-match dedup (date typos, city variants, trailing-word swaps).
Description
No description provided
Languages
Go
60.3%
Svelte
20.3%
Dart
11.1%
TypeScript
5%
PLpgSQL
1.1%
Other
2.1%