Skip to content

Universal Document Extractor

Heavstal Universal Extractor is a multi-format parsing engine. It accepts a direct URL to a document or code file and returns the clean, raw text content. This is ideal for feeding documents into LLMs or search indexes.

  • PDF (.pdf)
  • Word (.docx)
  • Plain Text (.txt)
  • Code (.js, .ts, .py, .java, .html, .css, .json, .md, etc.)

POST /doc-extract

FieldTypeRequiredDescription
urlstringYesDirect URL to the file.
const res = await fetch('https://heavstal.com.ng/api/v1/doc-extract', {
method: 'POST',
headers: {
'Content-Type': 'application/json',
'x-api-key': 'YOUR_API_KEY'
},
body: JSON.stringify({ url: 'https://example.com/contract.docx' })
});
{
"status": "success",
"creator": "HEAVSTAL TECH",
"data": {
"detected_type": "application/vnd.openxmlformats-officedocument.wordprocessingml.document",
"extension": "docx",
"content": "Contract Agreement\nThis agreement is made between..."
}
}