Skip to content

Xero App Store Intelligence

Last updated: 2026-03-30 23:30 UTC
Status: operational
Canonical sources: repos/xero/tools/intelligence/*

This page is the current front door for the Xero intelligence stack. It reflects the live outside-App-Store pain radar, the Postgres search layer, and the ranked solo-dev opportunity outputs.

Current Corpus

  • 475 apps (full catalog)
  • 18,612 reviews (all-time)
  • 5,272 product ideas / feature requests
  • 199 Reddit posts (151 from last 30 days)
  • 14 Xero Central community posts
  • 4,431 derived signals (pain points, feature requests, opportunities)
  • 18 AI synthesis / research entries

Total searchable documents: ~29,000+

Data Source Status

🟢 Active (auto-updating)

  • Xero App Store (apps + reviews): Weekly full crawl. SQLite xero-intel.db. 475 apps, 18,612 reviews.
  • Reddit (r/xero + 90+ subreddits): RSS-based scraper, no auth needed. Last run: 2026-03-30. 199 posts, 18 new this run.
  • Derived signals: Auto-generated from review sentiment, rating drops, keyword extraction.

🟡 Restored (manual trigger)

  • Xero Central: Gemini grounding + known URL harvest. 14 items. Limited by community page structure.
  • Product Ideas: 5,272 items. Scraper needs periodic re-runs.

🔴 Not Yet Integrated

  • XeroAPI GitHub Issues: Not scraped yet. Low volume but high signal for integration developers.
  • Xero Blog: Scraped once (Feb 2026), not in regular rotation.
  • Hacker News: One-shot scrape. Low Xero-specific volume.

Architecture

Sources (RSS/HTML/API)
    ↓
Scrapers (Node.js, repos/xero/tools/intelligence/)
    ↓
SQLite (xero-intel.db, 27MB, FTS5 indexes)
    ↓
Signal Layer (pain extraction, opportunity scoring)
    ↓
Postgres (skool_knowledge.friday_memory) for unified cross-domain search

GStack + Superpowers Lens: "So What" Analysis

What the data tells us (applied frameworks)

GStack (Generative Stack) view: The Xero ecosystem has a clear "missing middle" problem. There are ~475 apps in the store, but the distribution is extreme: a handful of large players (Dext, Hubdoc, Receipt Bank/Spendesk) dominate the top categories, while hundreds of poorly-maintained apps with <10 reviews fill the long tail. The opportunity is in the gap between "enterprise SaaS with sales teams" and "hobby projects that got listed."

Compound/Superpowers view: Each new data source we add creates non-linear value. The App Store reviews alone are useful. Combined with Reddit pain signals, they become powerful. Add Product Ideas voting data, and you can quantify demand before building. This is the compound intelligence moat: no single competitor has all three signals in one searchable corpus.

Top 5 Solo-Dev Opportunities (refreshed 2026-03-30)

  1. Cash flow forecasting with AI narrative (pain: high, competition: medium)
  2. Reddit signal: 23 posts asking for "simple cash flow" tools
  3. Product Ideas: 847 votes on cash flow visibility requests
  4. Gap: existing tools are spreadsheet-heavy; none use LLM-generated plain-English summaries

  5. Receipt/expense auto-categorization (pain: very high, competition: high but fragmented)

  6. Reviews: Dext has 2,100+ reviews but 1-star rate is 18%
  7. Reddit: consistent complaints about OCR accuracy
  8. Gap: a lightweight, Xero-native alternative without Dext's pricing

  9. Multi-currency reconciliation helper (pain: medium-high, competition: low)

  10. Product Ideas: 312 votes on multi-currency improvements
  11. Reddit: expat/digital nomad segment underserved
  12. Gap: no dedicated tool; users build manual workarounds

  13. Invoice payment reminder automation (pain: high, competition: medium)

  14. Reviews: Chaser dominates but pricing is £100+/mo
  15. Reddit: small businesses want "just the reminders, nothing else"
  16. Gap: simple, cheap, Xero-native payment chasing

  17. Bank feed error resolver (pain: high, competition: zero)

  18. Xero Central: 4 of 14 posts are about bank feed issues
  19. Product Ideas: hundreds of votes on feed reliability
  20. Gap: diagnostic tool that explains and suggests fixes for common feed errors

Applied to OYN

OYN's positioning in this landscape: the intelligence layer that sits above all these point solutions. Instead of competing on any single feature, OYN watches the user's entire Xero ecosystem and surfaces the most impactful action at any moment. The Xero intelligence stack feeds directly into OYN's competitive positioning:

  • We know which apps have the worst reviews (and why)
  • We know which features users are begging for (Product Ideas votes)
  • We know the real pain points that don't show up in App Store ratings (Reddit, Central)
  • We can position OYN as "the AI that replaces 3-4 single-purpose apps"

Maintenance Schedule

Task Frequency Method
App Store full crawl Weekly scrape-appstore.mjs
Review delta scrape Weekly scrape-reviews-targeted.mjs
Reddit RSS Daily (planned) scrape-reddit.mjs
Signal regeneration After each scrape build-signals.mjs
Xero Central Monthly scrape-xero-central.mjs
Product Ideas Monthly scrape-product-ideas.mjs
Postgres sync After scrape Manual (planned: automatic)

Next Steps

  1. Wire Reddit scraper into a nightly cron job (autonomous)
  2. Build automatic Postgres ingestion pipeline from SQLite after each scrape
  3. Add XeroAPI GitHub Issues as a new source
  4. Create weekly "Xero Ecosystem Pulse" auto-generated report
  5. Cross-reference Product Ideas voting trends with App Store review sentiment