Powerful features for effortless data extraction
Everything you need to collect structured data from the web without writing a single line of scraping code.
Flexible Data Sources
Choose between search queries or direct URLs to target exactly the data you need. WebFetch handles the crawling and extraction for you.
- Search Query Mode
Let WebFetch search the web for you. Just provide keywords and we'll find the most relevant sources.
- Direct URL Mode
Specify exact URLs when you know where your data lives. Perfect for monitoring specific pages.
"query": "financial reports 2023 tech companies""urls": [
"https://example.com/reports/q1",
"https://example.com/reports/q2"
]"schema": {
"companyName": "string",
"revenue": "currency",
"growth": "percentage",
"reportDate": "date"
}Structured Data, Your Way
Define exactly what data you want to extract using our simple JSON schema format. Get precisely the data you need, in the format you want.
- Custom JSON Schema
Specify field names and data types to create a consistent structure for your extracted data.
- Data Type Support
Support for strings, numbers, dates, currencies, percentages, and more to ensure proper formatting.
Set It and Forget It
Define how often your jobs should run and for how long. WebFetch handles the scheduling so you don't have to.
- Flexible Intervals
Run jobs hourly, daily, or at custom intervals to match your data freshness requirements.
- Customizable Duration
Set jobs to run for days, weeks, or months depending on your monitoring needs.
"interval": "6h" // Every 6 hours
"interval": "24h" // Daily
"duration": "7d" // Run for 7 days
"duration": "30d" // Run for 30 days
"webhook_url": "https://your-api.example.com/webhook"GET /api/jobs/{job_id}/results
// Access your data anytimeFlexible Data Delivery
Choose how you want to receive your data. Send it directly to your systems via webhook or store it in our hosted database for API access.
- Webhook Delivery
Send data directly to your backend as it's collected. Perfect for real-time processing.
- Hosted Database
We store your data securely and make it available through our API. No backend required.
Built for Developers
A clean, intuitive interface that gets out of your way. Powerful features without the complexity.
- Simple Job Creation
Create new extraction jobs in minutes with our intuitive form-based interface.
- RESTful API
Access all WebFetch functionality programmatically through our comprehensive API.
curl -X POST https://api.webfetch.io/jobs \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d @job-config.jsonFrequently Asked Questions
How does WebFetch extract data from websites?
WebFetch uses advanced AI and pattern recognition to identify and extract the data you need based on your schema definition. You don't need to write any selectors or parsing logic.
Can I extract data from any website?
WebFetch works with most public websites. However, some sites with heavy JavaScript rendering or strict anti-scraping measures may have limitations. Our system is designed to respect robots.txt and site terms of service.
How accurate is the data extraction?
WebFetch achieves high accuracy for most structured data. The accuracy depends on the consistency of the source website's structure and the specificity of your schema. We continuously improve our extraction algorithms.
Is there a limit to how many jobs I can create?
Different plans have different limits on the number of jobs and extraction frequency. Check our pricing page for details on what each plan offers.
How is WebFetch different from writing my own scraper?
WebFetch eliminates the need to write and maintain scraping code, handle proxies, manage scheduling, or worry about site structure changes. We handle all the infrastructure so you can focus on using the data.
How long is data stored?
Data is stored according to your plan limits. Free plans typically store data for 30 days, while paid plans offer longer retention periods. You can always export your data at any time.