Complete Guide to Getting Data from X (formerly Twitter) — February 2026
Executive Summary
X has undergone massive API restructuring since the Musk acquisition. The old tiered subscription model (Free/Basic/Pro/Enterprise) has been replaced with pay-per-usage pricing (no monthly subscriptions). The platform now offers modern v2 endpoints, a new X Activity API (XAA) for real-time events, a News API powered by Grok, Community Notes API, and webhooks. Scraping is harder than ever but not impossible. Here's every method available today.
1. Official X API — Current Structure
Pricing Model (NEW — Pay-Per-Usage)
X has moved away from the old Free/Basic ($100/mo)/Pro ($5,000/mo) subscription tiers to a pay-per-usage model. You pay only for what you use — no monthly caps or subscriptions.
- Access: Sign up at console.x.com to manage apps, view usage, and purchase credits
- Status: ✅ Working
- Authentication: OAuth 1.0a (user context) and OAuth 2.0 (app-only / PKCE)
Available Endpoint Categories:
| Category | What You Get |
|---|---|
| Posts | Create, read, search, delete posts; full-archive search back to 2006 |
| Users | Profile lookup, follower/following lists |
| Direct Messages | Read/send DMs (with user auth) |
| Likes | Get liked posts, liking users, like/unlike |
| Lists | List management and membership |
| Spaces | Lookup, search, ticket buyers |
| Communities | Community data |
| Community Notes | Search eligible posts, create/evaluate notes (pilot) |
| Trends | Trending topics |
| News | Grok-curated news stories and headlines (NEW) |
| Media | Upload images, videos, GIFs |
| X Activity (XAA) | Real-time event subscriptions (NEW) |
| Filtered Stream | Real-time post delivery with up to 1,000 filter rules |
| Compliance | Compliance streams |
| Webhooks | Filtered stream & account activity via webhook delivery |
| Usage | Monitor your API consumption |
Key Features:
- Full-archive search: Search ALL public posts back to 2006 via
GET /2/tweets/search/all - Filtered stream: Up to 1,000 rules for real-time matching
- Rich data objects: Posts (full text, metrics, entities, annotations, threads), Users (profiles, follower counts, verification), Media, Polls
- Fields & Expansions: Customize responses to get exactly the data you need
Cost: Pay-per-usage (specific per-request pricing not publicly listed — check console.x.com) Gotchas: Must purchase credits upfront; pricing can be opaque
2. X API v2 vs v1.1
| Version | Status | Notes |
|---|---|---|
| v2 | ✅ Current — all new features | Modern endpoints, flexible pricing, recommended |
| v1.1 | ⚠️ Deprecated/Limited | Legacy; some endpoints may still respond but not maintained |
| Enterprise (GNIP 2.0) | ✅ Available | High-volume access, dedicated support |
Recommendation: Use v2 exclusively. v1.1 is on borrowed time.
3. Academic Research Access
- Status: ❌ Discontinued
- The Academic Research product track was eliminated in 2023 when Elon Musk restructured the API.
- No free/discounted academic tier exists anymore.
- Researchers must use the same pay-per-usage API as everyone else.
- Some universities have institutional enterprise agreements — check with your library/IT.
4. Firehose / Full-Archive Search
Full-Archive Search (v2)
- Status: ✅ Available via pay-per-usage
- Endpoint:
GET /2/tweets/search/all - Coverage: All public posts since 2006
- Cost: Pay-per-usage (expensive at scale)
Enterprise Firehose (GNIP 2.0)
- Status: ✅ Available (enterprise contracts only)
- Products:
- PowerTrack API: Complete real-time stream of all public posts, filterable
- Decahose API: Random 10% sample of all public posts + likes
- Historical PowerTrack: Batch access to full archive
- Search API: 30-day and full-archive search
- Compliance Firehose: All compliance events
- Engagement API: Impressions and engagement metrics
- Cost: Enterprise pricing — contact X sales (typically $10K–$100K+/month)
- Access: Managed via Gnip Console, dedicated account managers
5. Third-Party Data Providers
Companies that have enterprise X API access and resell/provide data:
| Provider | What They Offer | Status |
|---|---|---|
| Brandwatch (Cision) | Social listening, historical data, sentiment analysis | ✅ Active |
| Sprout Social | Social media management, analytics, listening | ✅ Active |
| Meltwater | Media monitoring, social listening | ✅ Active |
| Talkwalker (Hootsuite) | Social listening, analytics | ✅ Active |
| Sprinklr | Enterprise social suite, listening | ✅ Active |
| Synthesio (Ipsos) | Social intelligence platform | ✅ Active |
| NewsWhip | Social media content prediction/analytics | ✅ Active |
| Pulsar | Audience intelligence, social listening | ✅ Active |
| Socialbakers (Emplifi) | Social media analytics | ✅ Active |
Cost: $500–$50,000+/month depending on scale Gotchas: Many had to renegotiate after X's API changes; data coverage may have gaps. Some may have lost firehose access. Always verify current X data coverage before signing.
6. Scraping Tools
Nitter
- Status: ⚠️ Alive but requires real Twitter accounts
- Now requires session tokens from real X accounts (guest API was killed)
- Public instances are unreliable; most shut down
- Self-hosting possible but you need accounts to feed it
- Still provides RSS feeds, no-JS browsing
- Risk: Account bans, session expiry
snscrape
- Status: ⚠️ Largely broken for Twitter
- GitHub repo still exists, lists Twitter support
- In practice, X's anti-scraping measures have broken most Twitter functionality
- May work intermittently; requires ongoing maintenance
- Still works for other platforms (Instagram, Reddit, etc.)
Twint
- Status: ❌ Dead
- Unmaintained, completely broken since Twitter API changes in 2023
Other Scrapers
| Tool | Status |
|---|---|
| Twikit | ⚠️ Intermittent — uses unofficial API, needs account auth |
| twitter-scraper (Python) | ❌ Mostly broken |
| Selenium/manual scrapers | ⚠️ Possible but fragile (see browser automation below) |
| yt-dlp | ✅ Can download Twitter/X videos |
Overall scraping verdict: Extremely difficult. X has aggressive bot detection, rate limiting, and legal enforcement. Any scraping approach requires authenticated accounts and is fragile.
7. Browser Automation (Playwright/Puppeteer)
- Status: ⚠️ Possible but extremely fragile
- X requires login for most content viewing
- Heavy anti-bot detection (fingerprinting, CAPTCHAs, behavioral analysis)
- Approaches:
- Login with real account via Playwright
- Use stealth plugins (playwright-extra, puppeteer-extra-plugin-stealth)
- Rotate proxies, randomize behavior
- Data accessible: Anything you can see as a logged-in user
- Rate: Very slow compared to API
- Risk: Account bans, IP blocks, ToS violation
- Cost: Proxy costs ($50-500/mo for residential proxies)
Recommendation: Last resort. Use the API instead.
8. RSS Feeds
Native X RSS
- Status: ❌ X killed native RSS feeds years ago
Nitter RSS
- Status: ⚠️ Works on self-hosted instances (with account tokens)
- Format:
https://your-nitter-instance/username/rss - Provides user timelines, searches, hashtags as RSS
- Requires maintaining Nitter + session tokens
Third-party RSS bridges
- RSSHub (rsshub.app): ⚠️ X routes may be broken/limited
- RSS.app, Feedbin: Some offer X-to-RSS conversion (paid services)
- OpenRSS: ⚠️ Limited X support
Recommendation: Self-hosted Nitter is the most reliable RSS option, but requires maintenance.
9. Google Cache / Wayback Machine / Archives
Google Cache
- Status: ❌ Google removed the cache feature in 2024
- Google no longer provides cached page views
Wayback Machine (web.archive.org)
- Status: ✅ Working for historical tweets
- Many tweets were archived before they were deleted
- Search:
https://web.archive.org/web/*/twitter.com/username/status/* - Limited — only pages that were crawled
- Cost: Free
Other Archives
- Archive.today (archive.ph): ✅ Works, users can save snapshots of tweets
- Politwoops (by ProPublica): ✅ Tracks deleted politician tweets
- Thread Reader App (threadreaderapp.com): ✅ Archives tweet threads
- Community Notes data dump: ✅ Available on GitHub (twitter/communitynotes)
10. X Ads API
- Status: ✅ Working
- Access: Requires approved Ads API access + active ad account
- What you can do:
- Campaign management (create, schedule, manage)
- Custom/Tailored Audiences (upload your own data)
- Creatives management (draft posts, cards, media)
- Advertising analytics — granular campaign insights, impressions, engagements, conversions
- Data available: Campaign performance metrics, audience insights, spend data
- Cost: Requires active ad spend + API access approval
- Tools: Postman collection available,
xurlCLI tool
Gotcha: This is advertising data, not general public tweet data. Useful for marketers, not researchers.
11. X Analytics / Creator Studio
- Status: ✅ Available to account owners
- Access: analytics.x.com (for your own account)
- Data available:
- Tweet impressions, engagement rate, link clicks
- Follower demographics (interests, location, gender)
- Top tweets, mentions
- Video views, completion rates
- Monthly summary
- Export: CSV export available for tweet-level data
- Cost: Free (for your own account data)
- Gotcha: Only YOUR account's data. No access to other accounts' analytics.
X Premium / X Blue Analytics
- Premium subscribers get enhanced analytics including longer history
12. Grok on X
- Status: ✅ Active and integrated into X
- What Grok can access:
- Real-time X posts and conversations
- Trending topics and discussions
- Can search and summarize X content on any topic
- Powers the News API (Grok-curated trending stories)
- News API:
GET /2/news/{id}andGET /2/news/search— returns Grok-curated news with summaries, context entities, related post IDs - How to use for research:
- Ask Grok questions about what's being discussed on X
- Grok can analyze sentiment, summarize threads, identify trends
- Available via X Premium subscription or xAI API
- Limitations: Grok summaries may be inaccurate; limited to what Grok can see; not a structured data export tool
- Cost: X Premium subscription ($8-16/mo) for in-app Grok; xAI API for programmatic access
13. Community Notes API
- Status: ✅ Working (Pilot)
- Endpoints:
GET /2/notes/search/posts_eligible_for_notes— find posts eligible for notesGET /2/notes/search/notes_written— get notes you've writtenPOST /2/notes— create a community noteDELETE /2/notes/{id}— delete a notePOST /2/evaluate_note— evaluate before submitting- Requirements: Must be enrolled as an AI Note Writer in Community Notes
- Rate limits: 90 requests/15 min; 250 creates/day
- Cost: Pay-per-usage API pricing
Community Notes Open Data
- Status: ✅ Available
- All Community Notes data is published openly at communitynotes.x.com
- Downloadable TSV files with notes, ratings, and note status history
- Cost: Free
- This is the best source for Community Notes research
14. X Spaces
- Status: ✅ API available
- Endpoints:
- Lookup by Space ID (single or batch)
- Lookup by creator ID
- Search Spaces by keyword
- Get ticket buyers list
- Data available: Space metadata (title, state, host, speakers, participant count, scheduled start, language)
- Limitations:
- No audio streaming/recording via API
- Spaces are ephemeral — unavailable after ending
- Only live and scheduled Spaces are accessible
- Recording Spaces: No official API. Third-party tools and browser extensions exist (risky, ToS violation)
- X Activity API: Can subscribe to
spaces.startandspaces.endevents for specific users
15. Webhooks / Account Activity API
Webhooks (v2)
- Status: ✅ Working
- Types:
- Filtered Stream Webhooks: Receive matching posts via webhook instead of persistent connection
- Account Activity API: Receive events for subscribed accounts
- Requirements: HTTPS endpoint, CRC validation, respond within 10 seconds
- Security: HMAC-SHA256 signature verification
X Activity API (XAA) — NEW
- Status: ✅ Working
- A major new addition — real-time event subscriptions
- Delivery: Persistent HTTP stream OR webhooks
- Event types:
- Profile updates (bio, picture, banner, username, location, URL, badges)
- Follow/unfollow events
- Spaces start/end
- DM sent/received (private — requires OAuth)
- News events (Grok-curated trends)
- Subscription limits:
- Self-serve: 1,000 subscriptions
- Enterprise: 50,000
- Partner: 100,000
- Latency: Sub-second
Legacy Account Activity API (Enterprise/GNIP)
- Status: ✅ Available via enterprise
- Posts, DMs, likes, follows, and more
16. Legal / Compliance Considerations
Terms of Service
- Scraping is explicitly prohibited by X's ToS
- Automated access without API authorization is prohibited
- X has sued scrapers (e.g., multiple lawsuits in 2023-2024)
- X sends cease-and-desist letters to scraping projects
Rate Limits
- API: Per-endpoint rate limits (varies); usage-based billing
- Aggressive rate limiting on unauthenticated access
- IP-based rate limiting on x.com itself
GDPR / Privacy
- X data may contain personal data subject to GDPR
- Redistribution of X data has restrictions
- Must comply with X's Developer Agreement
IP Bans
- X actively blocks datacenter IPs, VPN ranges, and known scraping IPs
- Residential proxies are harder to block but still detected via behavioral analysis
Research Exceptions
- No formal academic exemption exists anymore
- Some researchers argue fair use; untested in most jurisdictions for API scraping
17. Open Source Alternatives
Bluesky (AT Protocol)
- Status: ✅ Fully open, growing rapidly
- API: Free, open, no authentication required for public data
- Public API:
https://public.api.bsky.app - Endpoints: Profiles, posts, feeds, search, notifications, moderation, lists
- Firehose: Available — can subscribe to all public events
- Rate limits: Generous
- Cost: Free
- Libraries: Python (atproto), TypeScript (@atproto/api), etc.
- Best for: Researchers who need open social media data
Mastodon (ActivityPub/Fediverse)
- Status: ✅ Open, federated
- API: RESTful, well-documented, free
- Data: Public posts, profiles, trends per instance
- Limitations: Federated nature means no single firehose; each instance has its own data
- Cost: Free
Threads (Meta)
- Status: ⚠️ Limited API (via Instagram Graph API)
- Less open than Bluesky
Recommendation: If you need open social media data for research, Bluesky is the best alternative — fully open protocol, free firehose, no authentication barriers for public data.
18. Data Marketplaces
| Source | What's Available | Status |
|---|---|---|
| Kaggle | Historical Twitter datasets (pre-2023 mostly) | ✅ Free |
| Archive.org | Twitter Stream archives (historical) | ✅ Free |
| Pushshift | Limited Twitter data (Reddit-focused) | ⚠️ Limited |
| Bright Data | Web scraping infrastructure + datasets | ✅ Paid |
| Data.world | Some Twitter datasets | ✅ Mixed |
| GDELT | Tracks URLs shared on Twitter | ✅ Free |
| Docnow Catalog | Tweet ID datasets for research | ✅ Free (IDs only — must hydrate) |
Gotcha: Most historical datasets contain only tweet IDs (not full text) due to ToS. You must "hydrate" them via the API, but deleted tweets won't hydrate.
19. OSINT Tools
| Tool | What It Does | Status |
|---|---|---|
| Social Searcher | Social media search engine | ✅ Working (limited X data) |
| Twint | OSINT Twitter scraper | ❌ Dead |
| Maltego | Link analysis with Twitter transforms | ⚠️ Limited (API-dependent) |
| SpiderFoot | OSINT automation | ⚠️ Limited X modules |
| Sherlock | Username search across platforms | ✅ Works for checking X usernames |
| Maigret | Advanced username search | ✅ Works |
| TweetBeaver | Twitter analytics/OSINT | ⚠️ Unreliable |
| Followerwonk (Moz) | Twitter analytics | ⚠️ Limited after API changes |
| Botometer (Indiana U) | Bot detection | ⚠️ Depends on API access |
| Bellingcat tools | Various OSINT | ⚠️ Twitter-specific tools degraded |
Verdict: Most dedicated Twitter OSINT tools are degraded or broken. The API is now the primary path for any reliable OSINT work on X.
20. Embedded Tweets / oEmbed
- Status: ✅ Working
- oEmbed API: Returns embed HTML for any public tweet
- Endpoint:
https://publish.twitter.com/oembed?url=https://x.com/user/status/123 - Embedded Tweets: Copy/paste HTML or use oEmbed API
- Embedded Timelines: Profile or list timeline embeds
- X Publish: publish.x.com — visual embed builder
- Tweet/Follow Buttons: Still available
- WordPress plugin: Available
- Cost: Free
- Gotcha: Embedded tweets load via JavaScript from X's servers; they break if the tweet is deleted
🏆 Recommendations: What's Actually Worth Using Today
For Developers / Businesses
- X API v2 (pay-per-usage) — The primary legitimate path. Full-archive search, streaming, all data types.
- X Activity API (XAA) — Excellent for real-time monitoring of specific accounts/events.
- Webhooks — Better than polling for event-driven architectures.
For Researchers
- X API v2 — Pay-per-usage; budget carefully.
- Community Notes open data — Free, comprehensive, downloadable.
- Bluesky AT Protocol — Free, open alternative with growing user base.
- Wayback Machine / Archive.today — For historical content.
For Social Listening / Marketing
- Third-party providers (Brandwatch, Meltwater, Sprinklr) — Easier than direct API, bundled with analytics.
- X Ads API — If you're running ads anyway.
- Grok / News API — For trending topic analysis.
For OSINT / Investigations
- X API v2 — Legitimate, reliable.
- Nitter (self-hosted) — For privacy-preserving browsing + RSS.
- Archive services — Wayback Machine, archive.today.
- Community Notes data — Publicly available.
❌ Avoid
- snscrape / Twint — Broken or unreliable
- Unauthenticated scraping — Will get you blocked immediately
- Browser automation at scale — Fragile, expensive, risky
💡 Sleeper Pick
- Bluesky's open API + firehose — If your use case can work with Bluesky data instead of X data, it's dramatically easier, cheaper, and more reliable. Growing user base makes it increasingly viable.
Report compiled February 12, 2026. X's API landscape changes frequently — verify current pricing and availability at docs.x.com and console.x.com.