vikingowl 073e55c7fc feat(discovery): drop link-check from crawl path, fix suendenfrei pagination, add similarity helper
- Service.Crawl no longer link-verifies Quellen/Website for crawler
  events. Those URLs come from real HTML of trusted sources and have
  been implicitly verified at parse time. Removing this makes the
  insert phase complete in well under a minute even for 1500+ events
  and stops attributing timing-limited processing as link failures.
  LinkCheckFailed counter retained for JSON shape stability.

- Suendenfrei pagination now stops on len(events) == 0. Previously the
  site's footer <h3><a> links kept anchors.Length() > 0 indefinitely,
  sending the crawler to page-90 before the outer ctx timeout.

- New similarity helper (SimilarityScore, FindSimilar) and endpoint
  GET /api/v1/admin/discovery/queue/:id/similar. Multiplicative score
  of normalized-name Levenshtein ratio gating city-match and date-
  proximity bonuses. Prevents coincident-city/date events from being
  incorrectly flagged as near-duplicates when their names differ.
  Lets admin review flag near-duplicates that slip past exact-match
  dedup (date typos, city variants, trailing-word swaps).
2026-04-18 20:05:07 +02:00
2026-02-21 07:10:30 +01:00
2026-04-07 02:53:03 +02:00
Description
No description provided
1.7 MiB
Languages
Go 60.3%
Svelte 20.3%
Dart 11.1%
TypeScript 5%
PLpgSQL 1.1%
Other 2.1%