Add newsletter-mail crawler ingestion slice #34

Merged
florianbeisel merged 1 commit from clawteam/rauchbar-dev/backend-crawlers into main 2026-03-26 11:19:31 +01:00
florianbeisel commented 2026-03-26 11:14:06 +01:00 (Migrated from github.com)

Summary

  • add the first newsletter-mail ingestion slice for the crawler MVP
  • include a concrete Cigarworld mailbox-style HTML source, dedicated scraper, fixture-based parser coverage, and registry wiring
  • extend the worker/runtime ingestion path beyond webshop-only sources

Verification

  • corepack pnpm install --frozen-lockfile
  • corepack pnpm build
  • corepack pnpm typecheck
  • corepack pnpm --filter @rauchbar/worker test
## Summary - add the first newsletter-mail ingestion slice for the crawler MVP - include a concrete Cigarworld mailbox-style HTML source, dedicated scraper, fixture-based parser coverage, and registry wiring - extend the worker/runtime ingestion path beyond webshop-only sources ## Verification - corepack pnpm install --frozen-lockfile - corepack pnpm build - corepack pnpm typecheck - corepack pnpm --filter @rauchbar/worker test
florianbeisel commented 2026-03-26 11:15:10 +01:00 (Migrated from github.com)

One blocking issue on PR #34.

The PR is not currently mergeable: GitHub reports it as CONFLICTING / DIRTY against main, so this branch needs a refresh/rebase before it can be landed safely. I also rechecked the branch CI state and the required GitHub checks are green, but the merge conflict itself is a hard blocker for review-clear handoff.

One blocking issue on PR #34. The PR is not currently mergeable: GitHub reports it as `CONFLICTING` / `DIRTY` against `main`, so this branch needs a refresh/rebase before it can be landed safely. I also rechecked the branch CI state and the required GitHub checks are green, but the merge conflict itself is a hard blocker for review-clear handoff.
florianbeisel commented 2026-03-26 11:19:15 +01:00 (Migrated from github.com)

Re-review at 628277b: no blocking findings. The earlier conflict-only blocker is resolved, GitHub reports the branch as CLEAN/MERGEABLE again, the diff remains limited to the newsletter-mail crawler ingestion slice, and a local targeted worker test pass succeeded with corepack pnpm --dir /tmp/rauchbar-pr34 --filter @rauchbar/worker test -- cigarworld-newsletter-mail.test.ts (21/21 passing).

Re-review at 628277b: no blocking findings. The earlier conflict-only blocker is resolved, GitHub reports the branch as CLEAN/MERGEABLE again, the diff remains limited to the newsletter-mail crawler ingestion slice, and a local targeted worker test pass succeeded with corepack pnpm --dir /tmp/rauchbar-pr34 --filter @rauchbar/worker test -- cigarworld-newsletter-mail.test.ts (21/21 passing).
Sign in to join this conversation.
No description provided.