Back to All MCP Servers

Best Claude Code MCP Servers for Search Data Extraction

105 MCP servers tagged with “search-data-extraction

Rippr
YouTube transcript extraction for AI agents. Clean text, timestamps, or structured JSON from any video. No API keys required. Install via `npx rippr-mcp`.
search-data-extractionapiaiagent
Job Searchoor
An MCP server for searching job listings with filters for date, keywords, remote work options, and more.
search-data-extraction
Aeo Cli
Audit URLs for AI crawler readiness — checks robots.txt, llms.txt, JSON-LD schema, and content density with 0-100 AEO scoring.
search-data-extractionaillm
Open WebSearch
Web search using free multi-engine search (NO API KEYS REQUIRED) — Supports Bing, Baidu, DuckDuckGo, Brave, Exa, and CSDN.
search-data-extractiongoapiai
SerpMCP
Google SERP search including web, images, news, maps, places, videos, and knowledge graph results via Ace Data Cloud API.
search-data-extractiongoapi
Markcrawl
Crawl websites into clean Markdown, search pages, and extract structured data with LLMs. Built-in MCP server for web research and RAG pipelines.
search-data-extractionllmrag
Servers Kagi
Kagi search API integration
search-data-extractionapi
Nexus
AI-powered web search server using Perplexity Sonar models with source citations. Zero-install setup via NPX.
search-data-extractionai
Webpage Screenshot
A MCP server for taking screenshots of webpages to use as feedback during UI developement.
search-data-extraction
Simple Arxiv
🐍 ☁️ MCP for LLM to search and read papers from arXiv
search-data-extractionllm
Simple Pubmed
🐍 ☁️ MCP to search and read medical / life sciences papers from PubMed.
search-data-extraction
Nyt
Search articles using the NYTimes API
search-data-extractionapi
Server Rag Web Browser
An MCP server for Apify's open-source RAG Web Browser Actor to perform web searches, scrape URLs, and return content in Markdown.
search-data-extractionbrowserapirag
Argus
Multi-provider search broker with automatic fallback, RRF ranking, content extraction, and budget enforcement.
search-data-extraction
Clojars
Clojars MCP Server for upto date dependency information of Clojure libraries
search-data-extraction
Arxiv
Search ArXiv research papers
search-data-extraction
Boikot
Model Context Protocol Server for looking up company ethics information. Learn about the ethical and unethical actions of major companies.
search-data-extraction
Brave Search
Web search capabilities using Brave's Search API
search-data-extractionapi
Activitypub
A comprehensive MCP server that enables LLMs to explore and interact with the Fediverse through ActivityPub protocol. Features WebFinger discovery, timeline fetching, instance exploration, and cross-platform support for Mastodon, Pleroma, Misskey, and other ActivityPub servers.
search-data-extractionllm
Gopher
Modern, cross-platform MCP server enabling AI assistants to browse and interact with both Gopher protocol and Gemini protocol resources safely and efficiently. Features dual protocol support, TLS security, and structured content extraction.
search-data-extractiongosecurityai
Unsplash
Unsplash photo search with proper attribution. Returns ready-to-use attribution text and HTML for each photo, making it easy for LLMs to build content pages with properly credited images. Includes search, random photos, and download tracking.
search-data-extractionllm
Server Google News
Google News integration with automatic topic categorization, multi-language support, and comprehensive search capabilities including headlines, stories, and related topics through [SerpAPI](https://serpapi.com/).
search-data-extractiongoapi
Page Capture
MCP server that captures webpage screenshots, with viewport or full-page options and base64 PNG output.
search-data-extraction
Openai Websearch
This is a Python-based MCP server that provides OpenAI `web_search` built-in tool.
search-data-extractionpythonai
Crawleo
Crawleo Search & Crawl API
search-data-extractionapi
Kagi Ken
Work with Kagi *without* API access (you'll need to be a customer, tho). Searches and summarizes. Uses Kagi session token for easy authentication.
search-data-extractionapi
Dappier
Enable fast, free real-time web search and access premium data from trusted media brands—news, financial markets, sports, entertainment, weather, and more. Build powerful AI agents with Dappier.
search-data-extractionrustaiagent
Opennutrition
Local MCP server for searching 300,000+ foods, nutrition facts, and barcodes from the OpenNutrition database.
search-data-extraction
DealExpress Server
MCP Server for DealX platform
search-data-extraction
Trieve
Crawl, embed, chunk, search, and retrieve information from datasets through [Trieve](https://trieve.ai)
search-data-extractionai
Domain Search
Fast domain availability aggregator with pricing. Checks Porkbun, Namecheap, GoDaddy, RDAP & WHOIS. Includes bulk search, registrar comparison, AI-powered suggestions, and social media handle checking.
search-data-extractiongoai
Server Dumplingai
Access data, web scraping, and document conversion APIs by [Dumpling AI](https://www.dumplingai.com/)
search-data-extractionscrapingapiai
Melrose
Plays [Melrōse](https://melrōse.org) music expressions as MIDI
search-data-extraction
Decompose
Decompose text into classified semantic units with authority, risk, attention scores, and entity extraction. No LLM. Deterministic. Works as MCP server or CLI.
search-data-extractionllm
Hn
An MCP server to search Hacker News, get top stories, and more.
search-data-extraction
Exa
A Model Context Protocol (MCP) server lets AI assistants like Claude use the Exa AI Search API for web searches. This setup allows AI models to get real-time web information in a safe and controlled way.
search-data-extractionapiai
Search1api
Search via search1api (requires paid API key)
search-data-extractionapiai
Youtube
MCP server that transcribes YouTube videos to text. Uses yt-dlp to download audio and OpenAI's Whisper-1 for more precise transcription than youtube captions. Provide a YouTube URL and get back the full transcript splitted by chunks for long videos.
search-data-extractionai
Multi Agents Research
a KTOR server/ MCP server written in Kotlin applying multi-agents schools in a flexible research system to be used with coding or for research any general case.
search-data-extractionkotlinagent
Agent Domain Service
AI-powered domain brainstorming, analysis, and availability checking via AgentDomainService.com. Generate creative domain names from descriptions, get AI scoring for brandability/memorability, and check real-time availability with pricing. No API keys required.
search-data-extractionapiaiagent
Paperswithcode
🐍 ☁️ MCP to search through PapersWithCode API
search-data-extractionapi
Unsplash
A MCP server for Unsplash image search.
search-data-extraction
Himalayas
Access tens of thousands of remote job listings and company information. This public MCP server provides real-time access to Himalayas' remote jobs database.
search-data-extraction
Searxng
A Model Context Protocol Server for [SearXNG](https://docs.searxng.org)
search-data-extraction
Claude Hackernews
An integration that allows Claude Desktop to interact with Hacker News using the Model Context Protocol (MCP).
search-data-extraction
Domain Availability
A Model Context Protocol (MCP) server that enables Claude Desktop to check domain availability across 50+ TLDs. Features DNS/WHOIS verification, bulk checking, and smart suggestions. Zero-clone installation via uvx.
search-data-extractionai
Rss Aggregator
Model Context Protocol Server for aggregating RSS feeds in Claude Desktop.
search-data-extraction
Naver Search
MCP server for Naver Search API integration, supporting blog, news, shopping search and DataLab analytics features.
search-data-extractionapi
Fetcher
MCP server for fetching web page content using Playwright headless browser, supporting Javascript rendering and intelligent content extraction, and outputting Markdown or HTML format.
search-data-extractionjavascriptbrowser
G Search
A powerful MCP server for Google search that enables parallel searching with multiple keywords simultaneously.
search-data-extractiongo
Stocky
An MCP server for searching and downloading royalty-free stock photography from Pexels and Unsplash. Features multi-provider search, rich metadata, pagination support, and async performance for AI assistants to find and access high-quality images.
search-data-extractionperformanceai
Read Website Fast
Fast, token-efficient web content extraction for AI agents - converts websites to clean Markdown while preserving links. Features Mozilla Readability, smart caching, polite crawling with robots.txt support, and concurrent fetching.
search-data-extractionaiagent
Screenshot Website Fast
Fast screenshot capture tool optimized for Claude Vision API. Automatically tiles full pages into 1072x1072 chunks for optimal AI processing with configurable viewports and wait strategies for dynamic content.
search-data-extractionapiai
Kagimcp
Official Kagi Search MCP Server
search-data-extraction
Anybrowse
Convert any URL to LLM-ready Markdown via real Chrome browsers. 3 tools: scrape, crawl, search. Free via MCP, pay-per-use via x402. Remote MCP endpoint: `https://anybrowse.dev/mcp`
search-data-extractionbrowserllm
Json Mcp Filter
Stop bloating your LLM context. Query & Extract only what you need from your JSON files.
search-data-extractionllm
Web Analyzer
Extracts clean web content for RAG and provides Q&A about web pages.
search-data-extractionrag
Tavily.git
Tavily AI search API
search-data-extractionapiai
Bing Search
Web search capabilities using Microsoft Bing Search API
search-data-extractionapi
Content Core
Extract content from URLs, documents, videos, and audio files using intelligent auto-engine selection. Supports web pages, PDFs, Word docs, YouTube transcripts, and more with structured JSON responses.
search-data-extraction
Linkedapi
MCP server that lets AI assistants control LinkedIn accounts and retrieve real-time data.
search-data-extractionapiai
Mineru
MCP server for MinerU document parsing API. Parse PDFs, images, DOCX, and PPTX with OCR (109 languages), batch processing (200 docs), page ranges, and local file upload. 73% token reduction with structured output.
search-data-extractionapi
Brightdata
Discover, extract, and interact with the web - one interface powering automated access across the public internet.
search-data-extraction
Brave Search
Web, Image, News, Video, and Local Point of Interest search capabilities using Brave's Search API
search-data-extractionapirest
Nab
Ultra-fast web fetcher and MCP server with HTTP/3, JS rendering, anti-fingerprinting, browser cookie auth, and 1Password integration. Fetches any URL as clean Markdown for AI context.
search-data-extractionbrowserai
Webresearch
Search Google and do deep web research on any topic
search-data-extractiongo
Duckduckgo
Web search using DuckDuckGo
search-data-extractiongo
Local Rag
"primitive" RAG-like web search model context protocol (MCP) server that runs locally. No APIs needed.
search-data-extractionapirag
NyxDocs
Specialized MCP server for cryptocurrency project documentation management with multi-blockchain support (Ethereum, BSC, Polygon, Solana).
search-data-extractiongoai
Octagon Deep Research
Lightning-Fast, High-Accuracy Deep Research Agent
search-data-extractiongoagent
Pdfmux
PDF extraction router with built-in MCP server. Classifies each page (digital, scanned, tables) and routes to the best backend (PyMuPDF, Docling, OCR, or optional LLM fallback). Per-page confidence scoring flags low-quality pages and auto-reextracts them — prevents silent RAG fa…
search-data-extractionaillmrag
Search
Highest Accuracy Web Search for AI
search-data-extractionai
Spectrawl
Unified web layer for AI agents. Search (8 engines), stealth browse, cookie auth, and act on 24 platforms. 5,000 free searches/month via Gemini Grounded Search.
search-data-extractionaiagent
Task
Highest Accuracy Deep Research and Batch Tasks MCP
search-data-extraction
Pearch
Best people search engine that reduces the time spent on talent discovery
search-data-extraction
Semanticapi
Natural language API discovery — search 700+ API capabilities, get endpoints, auth setup, and code snippets. Supports auto-discovery of new APIs.
search-data-extractionapi
Server Webcrawl
Advanced search and retrieval for web crawler data. Supports WARC, wget, Katana, SiteOne, and InterroBot crawlers.
search-data-extraction
Catalysishub
Unofficial MCP server for searching and retrieving scientific data from the Catalysis Hub database, providing access to computational catalysis research and surface reaction data.
search-data-extractionreact
Opentk
Access Dutch Parliament (Tweede Kamer) information including documents, debates, activities, and legislative cases through structured search capabilities (based on opentk project by Bert Hubert)
search-data-extraction
Server Deep Research
MCP server providing OpenAI/Perplexity-like autonomous deep research, structured query elaboration, and concise reporting.
search-data-extractionai
Wolframalpha
An MCP server lets AI assistants use the Wolfram Alpha API for real-time access to computational knowledge and data.
search-data-extractionapiai
Gxtract
GXtract is a MCP server designed to integrate with VS Code and other compatible editors. It provides a suite of tools for interacting with the GroundX platform, enabling you to leverage its powerful document understanding capabilities directly within your development environment.
search-data-extractionrag
Scrapeless
The Scrapeless Model Context Protocol service acts as an MCP server connector to the Google SERP API, enabling web search within the MCP ecosystem without leaving it.
search-data-extractiongoapi
Searchcraft
Official MCP server for managing Searchcraft clusters, creating a search index, generating an index dynamically given a data file and for easily importing data into a search index given a feed or local json file.
search-data-extraction
Searxng
An MCP Server to connect to searXNG instances
search-data-extraction
Driflyte
The Driflyte MCP Server exposes tools that allow AI assistants to query and retrieve topic-specific knowledge from recursively crawled and indexed web pages.
search-data-extractionai
Serpapi
SerpApi MCP Server for Google and other search engine results. Provides multi-engine search across Google, Bing, Yahoo, DuckDuckGo, YouTube, eBay, and more with real-time weather data, stock market information, and flexible JSON response modes.
search-data-extractiongoapi
Shopsavvy
Complete product and pricing data solution for AI assistants. Search for products by barcode/ASIN/URL, access detailed product metadata, access comprehensive pricing data from thousands of retailers, view and track price history, and more.
search-data-extractionai
Linkmeta Api
Free URL metadata extraction API (Open Graph, Twitter Cards, favicons, JSON-LD). No API key required.
search-data-extractionapi
Rescuedogs
Search and discover rescue dogs from European and UK organizations with AI-powered personality matching and detailed profiles.
search-data-extractionai
Arxiv Latex
Get the LaTeX source of arXiv papers to handle mathematical content and equations
search-data-extraction
GeekNews
An MCP Server that retrieves and processes news data from the GeekNews site.
search-data-extraction
Tat
Query articles, verified statistics, wire feed, and social tools from [The Agent Times](https://theagenttimes.com), the AI-native newspaper covering the agent economy. 13 tools including search, comments, citations, and agent leaderboards. No API key required.
search-data-extractionapiaiagent
Enrichr
A MCP server that provides gene set enrichment analysis using the Enrichr API
search-data-extractionapi
Agentql
MCP server that provides [AgentQL](https://agentql.com)'s data extraction capabilities.
search-data-extractionagent
Server Tavily
Tavily AI search API
search-data-extractionapiai
Urlbox
📇 🏠 A reliable MCP server for generating and managing screenshots, PDFs, and videos, performing AI-powered screenshot analysis, and extracting web content (Markdown, metadata, and HTML) via the [Urlbox](https://urlbox.com) API.
search-data-extractionapiai
Vectorize
[Vectorize](https://vectorize.io) MCP server for advanced retrieval, Private Deep Research, Anything-to-Markdown file extraction and text chunking.
search-data-extraction
Ncbi
Comprehensive NCBI/PubMed literature search server with advanced analytics, caching, MeSH integration, related articles discovery, and batch processing for all life sciences and biomedical research.
search-data-extraction
Webscraping Ai
Interact with [WebScraping.ai](https://webscraping.ai) for web data extraction and scraping.
search-data-extractionscrapingapiai
Webpeel
Smart web fetcher for AI agents with auto-escalation from HTTP to headless browser to stealth mode. Includes 9 MCP tools: fetch, search, crawl, map, extract, batch, screenshot, jobs, and agent. Achieved 100% success rate on a 30-URL benchmark.
search-data-extractionbrowseraiagent
Baseline
MCP server that searches Baseline status using Web Platform API
search-data-extractionapi
Duckduckgo Mpc
This is a TypeScript-based MCP server that provides DuckDuckGo search functionality.
search-data-extractiontypescriptgo
Youtube Summarize
MCP server that fetches YouTube video transcripts and optionally summarizes them. Supports multiple transcript formats (text, JSON, SRT, WebVTT), multi-language retrieval, and flexible YouTube URL parsing.
search-data-extraction
Zoomeye
Querying network asset information by ZoomEye MCP Server
search-data-extraction

Browse more topics

aerospace-astrodynamicsagentaggregatorsagileaiai-agentsai-personaairtablealertingamplitudeanalyticsapiapidogapifyapmarchitecture-designart-cultureatlassianatprotoauthautomationawsaxiomazurebigquerybiology-medicine-and-bioinformaticsbitbucketblockchainblueskybravebrowserbrowser-automationcachechatchromadbcicdcircleciclickhousecloudcloud-platformscloudflarecode-executioncoding-agentscolabcommand-linecommunicationcommunicationscommunityconfluencecontainerscontextcrash-reportingcrawlingcrmcustomer-data-platformscypherdashboardsdatadata-engineeringdata-extractiondata-platformsdata-sciencedata-science-toolsdata-visualizationdata-warehousedatabasedatabasesdatadogdbtdebuggingdeliverydeploymentdesigndeveloper-toolsdevopsdigitaloceandirectionsdiscorddockerdocumentationdrivee-commerceecommerceedgeelasticsearchemailembedded-systemembeddingembeddingsend-to-end-rag-platformsenterpriseenvironment-natureerrorsevent-driveneventsfetchfigmafile-systemsfilesfilesystemfinancefinance-fintechgaminggcpgeocodinggitgithubgitlabgogooglegoogle-cloudgoogle-mapsgptgrafanagraphgraphqlhashicorphome-automationhostinghttphubspotiacidentityincidentsinfrastructureissuesjamstackjavascriptjirak8skafkakanbanknowledge-baseknowledge-graphknowledge-memorykotlinkuberneteslegallibrarieslibsqllinearllmlocationlocation-serviceslogginglogsmapsmarkdownmarketingmediamemorymessagingmetricsmicrosoftmigrationsmilvusmongodbmonitoringmultimedia-processmysqlneo4jneonnetlifynewrelicnextjsno-codenodenosqlnotebooksnotesnotionobservabilityobsidianofficialolapon-callopenaiopenapiordersormos-automationother-tools-and-integrationspagerdutypaymentsperformancepersistencepersonapineconeplanningplaywrightpostgresprdprismaproblem-solvingproduct-analyticsproduct-managementproductivityproductsproject-managementpuppeteerpythonqdrantragraygunreactreal-timereasoningredisresearchresendrestrustsalessalesforcesandboxsastschemascrapingsearchsearch-data-extractionsecuritysemanticsemgrepsentryserverlessshopifyslacksmssnowflakesocialsocial-mediasoqlsoul-specsportsspreadsheetsqlsqlitestatic-analysisstoragestorestreamingstripesupabasesupport-service-managementswaggerswiftsybil-detectiontaskmastertasksterraformtestingtext-to-speechthinkingtimetimezonetinybirdtransactionaltranscriptstransformationstranslation-servicestravel-transportationtrellotrusttursotwiliotypescriptuiupstashutilityvcsvector-databasevercelversion-controlvideovueweaviatewebworkersworkflowworkplace-productivityyoutube