Stack Overflow / Stack Exchange Scraper
Search Stack Overflow and Stack Exchange by keyword or tags. Get structured questions with score, answers, views, tags, author, and link as JSON or CSV.
How it works
- 1Open it on Apify
Hit Run on Apify — it opens the tool in the cloud, no install.
- 2Set the inputs
Adjust
query,site,tags(sensible defaults are pre-filled). - 3Click Run
The tool runs on Apify’s cloud and collects the data for you.
- 4Export the results
Download as JSON, CSV or Excel, or pipe straight into your app, Google Sheets, or an AI agent.
Inputs
| Field | What it does | Type |
|---|---|---|
query | Keywords to search for in question titles and bodies (e.g. "async await", "git rebase conflict"). Can be left empty if you provide one or more Tags instead. | string |
site | Which Stack Exchange network site to search. | string |
tags | Comma-separated tags to filter by, e.g. "javascript,promise" or "python,pandas". Optional. A question must carry ALL listed tags. You can search by tags alone w | string |
sortBy | Ordering of results. "votes" = highest score first, "relevance" = best keyword match, "creation" = newest, "activity" = most recently active. | string |
maxItems | Maximum number of questions to return. The actor paginates the API (100 per page) until this many are collected or there are no more results. | integer |
notionConnector | Optional. Write each question as a page into your Notion when the run finishes. Authorize a Notion connector once in Settings → API & Integrations → MCP connect | string |
notionParentId | Optional. The Notion data source ID of the database to write into (only used if a Notion connector is set). Leave empty to create the pages privately in your wo | string |
What you get
A structured dataset — each result includes fields like:
answerCountbodycreatedAtisAnsweredownerNameownerReputationquestionIdscoretagstitleurlviewCountExport every run as JSON, CSV or Excel, or send it to your app, a database, Google Sheets, or an AI agent.
2 ready-to-run use cases
Top Pandas Questions on Stack Overflow by Votes
The highest-voted Stack Overflow questions for the pandas tag, ranked by score, with title, votes, views, answers and links. Handy for Python docs and FAQs.
Stack Overflow Keyword Search: async/await Questions
Searching Stack Overflow for async/await? Get matching questions ranked by relevance, with score, view counts and answers, exported to JSON or CSV.
Stack Overflow / Stack Exchange Scraper
Search Stack Overflow — or any of the big Stack Exchange sites (Server Fault, Super User, Ask Ubuntu, MathOverflow, Software Engineering) — by keyword and/or tags, and get back clean, structured questions. No API key, no login, no anti-bot.
It talks to the public api.stackexchange.com API directly, so results are fast and reliable. It paginates for you, dedupes by question id, and returns the question body as plain text (HTML stripped).
What you get per question
questionId, title, url, score, answerCount, viewCount, isAnswered, tags, ownerName, ownerReputation, createdAt (ISO), and body (the question text, HTML stripped).
Input
| Field | Notes |
|---|---|
query | Keywords (title + body). Optional if you give tags. |
site | stackoverflow, serverfault, superuser, askubuntu, mathoverflow, softwareengineering. |
tags | Comma-separated, e.g. javascript,promise. A question must carry all of them. |
sortBy | votes (default), relevance, creation, activity. |
maxItems | How many questions to collect (paginates the API in pages of 100). |
You must provide a query or at least one tag.
Output
One dataset row per question. Empty searches return a single diagnostic row and are not charged.
Example
{ "query": "async await", "site": "stackoverflow", "tags": "javascript", "sortBy": "votes", "maxItems": 50 }
Notes
The unauthenticated Stack Exchange API allows ~300 requests/day per IP. The actor respects the API's backoff directive (sleeps when asked) and stops cleanly when the daily quota is exhausted, returning a clear diagnostic instead of failing silently. For very large jobs, split by tag, keyword, or site.