API Endpoint
GET /api
Transform a webpage into structured JSON by providing its URL.The URL of the webpage to convert. Must be a valid HTTP or HTTPS URL.
Example Request
Response Structure
Indicates if the request was successful
The structured data extracted from the webpage
The page title extracted from the
<title> tagArray of unique absolute URLs extracted from
<a> tags. Excludes anchor links (#) and javascript: links. All relative URLs are resolved to absolute URLs.Array of unique absolute image URLs extracted from
<img> tags. All relative URLs are resolved to absolute URLs.Example Response
Error Responses
Invalid URL
Fetch Error
Deployment
Use Cases
- Web Scraping: Extract structured data from websites without parsing HTML
- Content Analysis: Analyze page structure and heading hierarchy
- Link Extraction: Build sitemaps or discover related content
- Search Indexing: Extract text and structure for custom search engines
- Content Migration: Extract content when migrating between platforms
- SEO Audits: Analyze heading structure and internal linking
Technical Details
- Built with Hono framework
- Runs on Cloudflare Workers
- Regex-based HTML parsing for fast extraction
- Automatically resolves relative URLs to absolute URLs
- Deduplicates links and images
- Returns clean, structured JSON ready for further processing
Processing Notes
- All HTML tags within headings are stripped, returning clean text
- Anchor links (starting with #) are excluded from the links array
- JavaScript URLs (javascript:) are excluded from the links array
- Duplicate links and images are automatically removed
- Relative URLs are resolved based on the requested page URL