Full Wikipedia Article Text Scraper (Plain Text)
Need the complete body of Wikipedia articles? This task returns full plain-text content by title, ready for NLP, summarization, and text analysis.
How it works
- 1Open it on Apify
Hit Run on Apify — it opens the tool in the cloud, no install.
- 2Set the inputs
Adjust
searchQuery,pageTitles,fullText(sensible defaults are pre-filled). - 3Click Run
The tool runs on Apify’s cloud and collects the data for you.
- 4Export the results
Download as JSON, CSV or Excel, or pipe straight into your app, Google Sheets, or an AI agent.
Inputs
| Field | What it does | Type |
|---|---|---|
searchQuery | Keywords to search Wikipedia for (e.g. "machine learning"). Returns matching pages with snippet, word count, and URL. Leave empty if you instead provide exact P | string |
pageTitles | Exact Wikipedia article titles to fetch full data for (plain-text extract, thumbnail, categories, URL). Batched 50 at a time. Use this OR a search query. | array |
fullText | Only applies in Page titles mode. When on, returns the whole article as plain text instead of just the intro paragraph(s). | boolean |
language | Wikipedia language edition code, e.g. en, fr, de, es, ja. Picks the host {lang}.wikipedia.org. | string |
maxItems | Maximum number of pages to return. In search mode the actor paginates until it reaches this. In page-titles mode it caps how many titles are fetched. | integer |
notionConnector | Optional. Write each page as a page into your Notion when the run finishes. Authorize a Notion connector once in Settings → API & Integrations → MCP connectors, | string |
notionParentId | Optional. The Notion data source ID of the database to write into (only used if a Notion connector is set). Leave empty to create the pages privately in your wo | string |
What you get
A structured dataset — each result includes fields like:
modepageidsizesnippettimestamptitleurlwordcountExport every run as JSON, CSV or Excel, or send it to your app, a database, Google Sheets, or an AI agent.
More use cases for Wikipedia Scraper
Wikipedia Search API: Find Pages by Keyword
Search Wikipedia by keyword and rank matching pages by relevance, with snippets and word counts so researchers can decide what to read first.
Bulk Wikipedia Fetch: Get Page Data by Title
Pass up to 50 page titles and get each Wikipedia entry's intro, thumbnail, and categories back as clean JSON for datasets and enrichment.