Doc to Text API API ID: 2677

Unlock the power of data with DocToText API – your ultimate solution for seamless document conversion. From DOC and PDF to images and emails, effortlessly transform diverse formats into plain text and HTML. Whether it's a small task or a large-scale project, experience top-tier OCR and email parsing capabilities. Simplify your data extraction journey today.

Use this API from your AI agent via MCP

Works with OpenClaw, Claude Code/Desktop, Cursor, Windsurf, Cline and any MCP-compatible AI client.

Docs & setup

Create a skill by wrapping this MCP: https://mcp.zylalabs.com/mcp?apikey=YOUR_ZYLA_API_KEY

About the API:

Empower Your Data Journey with DocToText API

DocToText API stands as the cornerstone of efficient data extraction, tailored for both small tasks and large-scale projects. This versatile tool seamlessly converts an extensive array of formats, including DOC, XLS, PPT, PDF, various email formats, and images, into plain text and HTML.

Advanced-Data Extraction Capabilities:

At the heart of DocToText API lies its cutting-edge OCR technology. Whether dealing with scanned documents, images, or complex PDFs, its high-grade, scriptable, and trainable OCR ensures accurate and reliable text extraction. This is complemented by robust email parsing capabilities, allowing seamless processing of EML, PST, OST, and other email formats.

Comprehensive Format Support:

DocToText API supports an impressive range of formats, from common office files like DOCX and XLSX to specialized formats such as iWork (PAGES, NUMBERS, KEYNOTE) and Outlook (PST, OST). Its flexibility extends to image formats like JPG, PNG, and TIFF, enabling extraction from various sources.

Seamless Integration for Every Project:

Whether you're managing a data-intensive enterprise application, conducting research, or automating routine office tasks, DocToText API integrates effortlessly into your workflow. Its adaptability allows for easy incorporation into diverse platforms, ensuring smooth data processing without disrupting your existing systems.

Customizable and Scalable:

DocToText API’s scriptable and trainable OCR capabilities enable customization for specific project requirements. It scales seamlessly, accommodating both small-scale tasks and high-volume data extraction projects. Its robustness ensures accuracy and consistency, even in demanding environments.

Reliable and Future-Ready:

DocToText API not only caters to your current needs but is also future-ready, accommodating emerging formats and technologies. Its continuous updates and enhancements guarantee that you're always equipped with the latest tools for efficient data extraction, making it an indispensable asset for businesses and developers alike. Simplify your data extraction challenges with DocToText API, your key to accurate, reliable, and scalable text extraction solutions.

What this API receives and what your API provides (input/output)?

Pass any document of your choice and receive the recognized text.

Formats: DOC, XLS, XLSB, PPT, RTF, ODF (ODT, ODS, ODP), OOXML (DOCX, XLSX, PPTX), iWork (PAGES, NUMBERS, KEYNOTE), ODFXML (FODP, FODS, FODT), PDF, EML, HTML, Outlook (PST, OST), Image (JPG, JPEG, JFIF, BMP, PNM, PNG, TIFF, WEBP)

What are the most common use cases of this API?

Digital Archiving and Document Management: Businesses and organizations can use the DocToText API to convert large volumes of documents, including scanned images and PDFs, into searchable and editable text. This facilitates efficient digital archiving and document management, enabling easy retrieval and editing of information. Libraries, historical societies, and governmental organizations can digitize historical documents for preservation and research purposes.
Business Intelligence and Data Analysis: Enterprises can employ the DocToText API to extract textual data from various reports, invoices, and financial documents. By converting this data into structured formats, such as CSV or JSON, businesses can perform in-depth data analysis. This use case is particularly valuable for financial institutions, market research firms, and e-commerce platforms, helping them gain valuable insights from textual data.
Content Aggregation and Analysis: Media monitoring companies, news agencies, and content aggregators can utilize the DocToText API to extract text from articles, blogs, and social media posts. By converting this unstructured data into readable text, these organizations can automate the process of content aggregation. Natural Language Processing (NLP) algorithms can then be applied for sentiment analysis, topic modeling, and other forms of content analysis.
Automated Customer Support and Service: Companies with large volumes of customer interactions, such as emails and support tickets, can benefit from the DocToText API. By converting customer queries and feedback into plain text, businesses can employ chatbots and automated systems to provide quick and accurate responses. This not only improves customer satisfaction by providing timely support but also reduces the workload on human customer support agents.
Data Enrichment for Machine Learning Models: Machine learning developers and data scientists can use the DocToText API to preprocess textual data for training machine learning models. By converting documents into plain text, this API ensures that the data is in a consistent format, ready for feature extraction and model training. This use case is crucial in various applications, including sentiment analysis, language translation, and text summarization.

Are there any limitations to your plans?

Besides the number of API calls available for the plan, there are no other limitations.

API Documentation

Endpoints

Extract Text Endpoint ID: 2781

Send file for extraction

Formats include:

DOC, XLS, XLSB, PPT, RTF, ODF (ODT, ODS, ODP),
OOXML (DOCX, XLSX, PPTX), iWork (PAGES, NUMBERS, KEYNOTE),
ODFXML (FODP, FODS, FODT), PDF, EML, HTML, Outlook (PST, OST),
Image (JPG, JPEG, JFIF, BMP, PNM, PNG, TIFF, WEBP)

                                                                            
POST https://pr157-testing.zylalabs.com/api/2677/doc+to+text+api/2781/extract+text

Extract Text - Endpoint Features

Object	Description
`Request Body`	[Required] File Binary

Test Endpoint

API EXAMPLE RESPONSE

       
                                                                                                        
                                                                                                                                                                                                                                                                                                                                        

IP Address Classes Range:

Class                           IP Address Range (Theoretical)  Application / Used for        
A                               0.0.0.0 to 127.255.255.255      Very large networks           
B                               128.0.0.0 to 191.255.255.255    Medium networks               
C                               192.0.0.0 to 223.255.255.255    Small networks                
D                               224.0.0.0 to 239.255.255.255    Multicast

Extract Text - CODE SNIPPETS


    curl --location 'https://zylalabs.com/api/2677/doc+to+text+api/2781/extract+text' \
    --header 'Content-Type: application/json' \ 
    --form 'image=@"FILE_PATH"'

API Access Key & Authentication

After signing up, every developer is assigned a personal API access key, a unique combination of letters and digits provided to access to our API endpoint. To authenticate with the Doc to Text API simply include your bearer token in the Authorization header.

Headers

Header	Description
`Authorization`	[Required] Should be `Bearer access_key`. See "Your API Access Key" above when you are subscribed.

Questions

Simple Transparent Pricing

No long-term commitment. Upgrade, downgrade, or cancel anytime. Free Trial includes up to 50 requests.

Monthly Annually

(Save 2 months with annual billing 🎉)

💫Basic

$99.99/Month

1,000 Requests / Month
Then $0.1299870 per request if limit exceeded.
Rate Limit: 60 reqs per minute
Specialized Customer Support
Real-Time API Monitoring
Unlimited Data Transfer Included

Free 7-day trial

No commitment. Cancel anytime

Popular

⚡Pro

$199.99/Month

2,500 Requests / Month
Then $0.1299870 per request if limit exceeded.
Rate Limit: 60 reqs per minute
Specialized Customer Support
Real-Time API Monitoring
Unlimited Data Transfer Included

Free 7-day trial

No commitment. Cancel anytime

🔥Pro Plus

$499.99/Month

5,000 Requests / Month
Then $0.1299870 per request if limit exceeded.
Rate Limit: 120 reqs per minute
Specialized Customer Support
Real-Time API Monitoring
Unlimited Data Transfer Included

Free 7-day trial

No commitment. Cancel anytime

💫Basic

$83.33/Month

1,000 Requests / Month
Then $0.1299870 per request if limit exceeded.
Rate Limit: 60 reqs per minute
Specialized Customer Support
Real-Time API Monitoring
Unlimited Data Transfer Included

Free 7-day trial

No commitment. Cancel anytime

Popular

⚡Pro

$166.66/Month

2,500 Requests / Month
Then $0.1299870 per request if limit exceeded.
Rate Limit: 60 reqs per minute
Specialized Customer Support
Real-Time API Monitoring
Unlimited Data Transfer Included

Free 7-day trial

No commitment. Cancel anytime

🔥Pro Plus

$416.66/Month

5,000 Requests / Month
Then $0.1299870 per request if limit exceeded.
Rate Limit: 120 reqs per minute
Specialized Customer Support
Real-Time API Monitoring
Unlimited Data Transfer Included

Free 7-day trial

No commitment. Cancel anytime

🚀 Enterprise

Starts at
$ 10,000/Year

Custom Volume
Custom Rate Limit
Specialized Customer Support
Real-Time API Monitoring

Book a Call

Customer favorite features

✔︎ Only Pay for Successful Requests
✔︎ Free 7-Day Trial
✔︎ Multi-Language Support
✔︎ One API Key, All APIs.
✔︎ Intuitive Dashboard

✔︎ Comprehensive Error Handling
✔︎ Developer-Friendly Docs
✔︎ Postman Integration
✔︎ Secure HTTPS Connections
✔︎ Reliable Uptime

Doc to Text API FAQs

What is the DocToText API, and what does it do?

The DocToText API is a data extraction tool that converts a variety of document formats, including DOC, PDF, images, and emails, into plain text and HTML. It utilizes advanced OCR and email parsing capabilities to extract text from scanned documents and emails, making the content easily accessible for further processing.

What document formats are supported by the DocToText API?

The DocToText API supports a wide range of formats, including DOC, XLS, PPT, PDF, various email formats (EML, PST, OST), and image formats (JPG, PNG, TIFF). It also handles specialized formats like iWork (PAGES, NUMBERS, KEYNOTE) and Outlook (PST, OST), ensuring compatibility with diverse data sources.

How accurate is the OCR technology used by the DocToText API?

The OCR technology integrated into the DocToText API is of high-grade quality. It is designed to accurately recognize text from scanned documents, images, and PDFs, ensuring reliable extraction even from complex or low-quality input sources.

Can the API handle large-scale data extraction projects?

Yes, the DocToText API is well-suited for both small tasks and large-scale data extraction projects. Its scalability allows it to efficiently process high volumes of documents, making it ideal for applications requiring extensive data extraction.

Is the API capable of extracting formatted text and images from documents?

The primary functionality of the DocToText API is to extract plain text and HTML from documents. While it focuses on textual content, it may not retain intricate formatting or images during the conversion process.

What type of data does the DocToText API return?

The DocToText API returns extracted text in plain text and HTML formats. This includes recognized text from various document types, such as DOC, PDF, and images, allowing users to easily access and manipulate the content.

What are the key fields in the response data?

The response data primarily includes the extracted text content. Depending on the document type, it may also contain metadata such as the original file name, format, and any relevant processing information.

How is the response data organized?

The response data is structured in a JSON format, typically containing fields for the extracted text, file metadata, and any error messages if applicable. This organization allows for easy parsing and integration into applications.

What parameters can be used with the endpoint?

The endpoint accepts parameters such as the document file (in supported formats), and optional settings for OCR customization, such as language selection or specific extraction options to enhance accuracy.

How can users customize their data requests?

Users can customize requests by specifying parameters like the desired output format (plain text or HTML) and selecting OCR settings, such as language or extraction preferences, to tailor the results to their needs.

What types of information are available through the API?

The API provides access to textual data extracted from documents, including scanned images, emails, and various file formats. This enables users to retrieve information for digital archiving, data analysis, and content aggregation.

How is data accuracy maintained?

Data accuracy is maintained through advanced OCR technology that is scriptable and trainable. Continuous updates and enhancements ensure the API adapts to new formats and improves extraction reliability over time.

What are typical use cases for this API?

Typical use cases include digital archiving of documents, data analysis for business intelligence, content aggregation for media monitoring, and preprocessing text for machine learning applications, enhancing data accessibility and usability.

General FAQs

What is your refund policy?

Please have a look at our Refund Policy: https://zylalabs.com/terms#refund

How do I get an API key?

To obtain your API key, you first need to sign in to your account and subscribe to the API you want to use. Once subscribed, go to your Profile, open the Subscription section, and select the specific API. Your API key will be available there and can be used to authenticate your requests.

Can I switch APIs during the free trial?

You can’t switch APIs during the free trial. If you subscribe to a different API, your trial will end and the new subscription will start as a paid plan.

What happens if I don’t cancel before the trial ends?

If you don’t cancel before the 7th day, your free trial will end automatically and your subscription will switch to a paid plan under the same plan you originally subscribed to, meaning you will be charged and gain access to the API calls included in that plan.

When does the free trial end?

The free trial ends when you reach 50 API requests or after 7 days, whichever comes first.

Can I use the free trial more than once?

No, the free trial is available only once, so we recommend using it on the API that interests you the most. Most of our APIs offer a free trial, but some may not include this option.

Do you offer a free trial?

Yes, we offer a 7-day free trial that allows you to make up to 50 API calls at no cost, so you can test our APIs without any commitment.

What is Zyla API Hub?

Zyla API Hub is like a big store for APIs, where you can find thousands of them all in one place. We also offer dedicated support and real-time monitoring of all APIs. Once you sign up, you can pick and choose which APIs you want to use. Just remember, each API needs its own subscription. But if you subscribe to multiple ones, you'll use the same key for all of them, making things easier for you.

What currencies and payment methods are allowed?

Prices are listed in USD (United States Dollar), EUR (Euro), CAD (Canadian Dollar), AUD (Australian Dollar), and GBP (British Pound). We accept all major debit and credit cards. Our payment system uses the latest security technology and is powered by Stripe, one of the world's most reliable payment companies. If you have any trouble paying by card, just contact us at [email protected]

Additionally, if you already have an active subscription in any of these currencies (USD, EUR, CAD, AUD, GBP), that currency will remain for subsequent subscriptions. You can change the currency at any time as long as you don't have any active subscriptions.

Why can't I pay with my local currency even though I see it on the pricing page?

The local currency shown on the pricing page is based on the country of your IP address and is provided for reference only. The actual prices are in USD (United States Dollar). When you make a payment, the charge will appear on your card statement in USD, even if you see the equivalent amount in your local currency on our website. This means you cannot pay directly with your local currency.

My payment was declined, what should I do?

Occasionally, a bank may decline the charge due to its fraud protection settings. We suggest reaching out to your bank initially to check if they are blocking our charges. Also, you can access the Billing Portal and change the card associated to make the payment. If these does not work and you need further assistance, please contact our team at [email protected]

How will I be charged for my API subscription?

Prices are determined by a recurring monthly or yearly subscription, depending on the chosen plan.

How will my API calls be deducted from my plan?

API calls are deducted from your plan based on successful requests. Each plan comes with a specific number of calls that you can make per month. Only successful calls, indicated by a Status 200 response, will be counted against your total. This ensures that failed or incomplete requests do not impact your monthly quota.

How does your billing cycle work?

Zyla API Hub works on a recurring monthly subscription system. Your billing cycle will start the day you purchase one of the paid plans, and it will renew the same day of the next month. So be aware to cancel your subscription beforehand if you want to avoid future charges.

How do I upgrade my current subscription plan with an API?

To upgrade your current subscription plan, simply go to the pricing page of the API and select the plan you want to upgrade to. The upgrade will be instant, allowing you to immediately enjoy the features of the new plan. Please note that any remaining calls from your previous plan will not be carried over to the new plan, so be aware of this when upgrading. You will be charged the full amount of the new plan.

How can I see the remaining number of API calls I can make this month?

To check how many API calls you have left for the current month, refer to the 'X-Zyla-API-Calls-Monthly-Remaining' field in the response header. For example, if your plan allows 1,000 requests per month and you've used 100, this field in the response header will indicate 900 remaining calls.

How do I find out the maximum number of API requests allowed in my subscription plan?

To see the maximum number of API requests your plan allows, check the 'X-Zyla-RateLimit-Limit' response header. For instance, if your plan includes 1,000 requests per month, this header will display 1,000.

How do I know when my rate limit will reset?

The 'X-Zyla-RateLimit-Reset' header shows the number of seconds until your rate limit resets. This tells you when your request count will start fresh. For example, if it displays 3,600, it means 3,600 seconds are left until the limit resets.

Can I cancel anytime?

Yes, you can cancel your plan anytime by going to your account and selecting the cancellation option on the Billing page. Please note that upgrades, downgrades, and cancellations take effect immediately. Additionally, upon cancellation, you will no longer have access to the service, even if you have remaining calls left in your quota.

What happens if I forget to cancel my free trial?

After 7 days, you will be charged the full amount for the plan you were subscribed to during the trial. Therefore, it's important to cancel before the trial period ends. Refund requests for forgetting to cancel on time are not accepted.

How many calls can I make during the free trial?

When you subscribe to an API free trial, you can make up to 50 API calls. If you wish to make additional API calls beyond this limit, the API will prompt you to perform an "Start Your Paid Plan." You can find the "Start Your Paid Plan" button in your profile under Subscription -> Choose the API you are subscribed to -> Pricing tab.

When are Payout Orders processed?

Payout Orders are processed between the 20th and the 30th of each month. If you submit your request before the 20th, your payment will be processed within this timeframe.

If I have any problems, who I should contact?

You can contact us through our chat channel to receive immediate assistance. We are always online from 8 am to 5 pm (EST). If you reach us after that time, we will get back to you as soon as possible. Additionally, you can contact us via email at [email protected]

Start Free Trial

Service Level

100%

Response Time

0ms

Category:

Data & Analytics

Tags:

#DOC

#Text

#PDF

#OCR Capabilities

#Document Parsing

#Document Transformation

Related APIs

PDF into Text API

The PDF to Text API allows users to effortlessly convert PDF files into text or words. By utiliz...

Tools & Utilities Free 7-Day Trial

Service Level:

100%

Response Time:

0ms

Audio to Text Converter API

The Audio to Text Converter API accurately transforms spoken content into written text, offering...

Voice & Speech Technology Free 7-Day Trial

Service Level:

100%

Response Time:

731ms

PDF Text Extractor API

The PDF to Text API is a simple solution for converting PDF files into text or words. It allows...

Tools & Utilities Free 7-Day Trial

Service Level:

91%

Response Time:

2,513ms

Extract Text from Documents API

Seamlessly convert scanned documents into editable text using the Extract Text from Documents AP...

Visual Recognition & Imaging Free 7-Day Trial

Service Level:

100%

Response Time:

1,945ms

Audio to Text API

The Audio to Text API converts spoken language into written text with high accuracy, enabling re...

Voice & Speech Technology Free 7-Day Trial

Service Level:

100%

Response Time:

0ms

Audio To Text Conversion API

The Audio To Text Conversion API transforms audio into written text with high accuracy, enabling...

Voice & Speech Technology Free 7-Day Trial

Service Level:

100%

Response Time:

0ms

Photo to Text Conversion API

Convert photos to text accurately and fast with our Photo to Text Conversion API.

Visual Recognition & Imaging Free 7-Day Trial

Service Level:

100%

Response Time:

2,450ms

Text Extractor API

The TextExtractor API converts scanned images and documents into editable text, extracting and r...

Visual Recognition & Imaging Free 7-Day Trial

Service Level:

100%

Response Time:

3,168ms

MP3 to Text API

The MP3 to Text API transforms spoken language into written text with exceptional accuracy, allo...

Voice & Speech Technology Free 7-Day Trial

Service Level:

100%

Response Time:

0ms

Retrieve Document Text API

Effortlessly extract and retrieve text from documents with our reliable Retrieve Document Text A...

Visual Recognition & Imaging Free 7-Day Trial

Service Level:

100%

Response Time:

1,429ms

Website Performance Analyzer API

Analyze any web page URL to obtain detailed performance metrics, information about JavaScript ex...

Data & Analytics Free 7-Day Trial

Service Level:

100%

Response Time:

18,532ms

Website Data Performance API

Retrieve and analyze website performance data effortlessly for optimized user experience.

Data & Analytics Free 7-Day Trial

Service Level:

100%

Response Time:

5,878ms

Website Performance Metrics Retriever API

Fetch comprehensive performance metrics for your website to improve speed and user satisfaction.

Data & Analytics Free 7-Day Trial

Service Level:

100%

Response Time:

20,003ms

Analyze Website Content Performance API

Uncover insights into your website's content performance and enhance engagement with our robust...

Data & Analytics Free 7-Day Trial

Service Level:

100%

Response Time:

20,003ms

Website Speed Analyzer API

Evaluate your website's speed and performance metrics to achieve faster loads and happier users.

Data & Analytics Free 7-Day Trial

Service Level:

100%

Response Time:

20,003ms

Web Traffic Watcher API

The Web Traffic Watcher API provides key performance metrics for any website, including traffic,...

Data & Analytics Free 7-Day Trial

Service Level:

100%

Response Time:

947ms

Web Performance Data Capture API

Capture crucial web performance data with our API, enabling deep analysis for your website’s spe...

Data & Analytics Free 7-Day Trial

Service Level:

100%

Response Time:

1,151ms

Web Speed Test API

Evaluate web performance by measuring server-client metrics, detect common issues, and generate...

Data & Analytics Free 7-Day Trial

Service Level:

100%

Response Time:

5,864ms

Site Status Monitor API

Monitor and verify website statuses with precision using the Site Status Monitor API for reliabl...

Tools & Utilities Free 7-Day Trial

Service Level:

100%

Response Time:

710ms

Capture Website Performance API

Capture essential performance data for your website to identify areas for improvement in real-ti...

Data & Analytics Free 7-Day Trial

Service Level:

100%

Response Time:

173ms

Doc to Text API API ID: 2677

About the API:

What this API receives and what your API provides (input/output)?

What are the most common use cases of this API?

Are there any limitations to your plans?

What would you like to see? See the information or check the documentation?

API Documentation

Endpoints

API EXAMPLE RESPONSE

Extract Text - CODE SNIPPETS

API Access Key & Authentication

Questions

Simple Transparent Pricing

💫Basic

$99.99/Month

⚡Pro

$199.99/Month

🔥Pro Plus

$499.99/Month

💫Basic

$83.33/Month

⚡Pro

$166.66/Month

🔥Pro Plus

$416.66/Month

🚀 Enterprise

Starts at $ 10,000/Year

Customer favorite features

Doc to Text API FAQs

What is the DocToText API, and what does it do?

What document formats are supported by the DocToText API?

How accurate is the OCR technology used by the DocToText API?

Can the API handle large-scale data extraction projects?

Is the API capable of extracting formatted text and images from documents?

What type of data does the DocToText API return?

What are the key fields in the response data?

How is the response data organized?

What parameters can be used with the endpoint?

How can users customize their data requests?

What types of information are available through the API?

How is data accuracy maintained?

What are typical use cases for this API?

General FAQs

What is your refund policy?

How do I get an API key?

Can I switch APIs during the free trial?

What happens if I don’t cancel before the trial ends?

When does the free trial end?

Can I use the free trial more than once?

Do you offer a free trial?

What is Zyla API Hub?

What currencies and payment methods are allowed?

Why can't I pay with my local currency even though I see it on the pricing page?

My payment was declined, what should I do?

How will I be charged for my API subscription?

How will my API calls be deducted from my plan?

How does your billing cycle work?

How do I upgrade my current subscription plan with an API?

How can I see the remaining number of API calls I can make this month?

How do I find out the maximum number of API requests allowed in my subscription plan?

How do I know when my rate limit will reset?

Can I cancel anytime?

What happens if I forget to cancel my free trial?

How many calls can I make during the free trial?

When are Payout Orders processed?

If I have any problems, who I should contact?

Service Level

Response Time

Category:

Tags:

Related APIs

PDF into Text API

Audio to Text Converter API

PDF Text Extractor API

Extract Text from Documents API

Audio to Text API

Audio To Text Conversion API

Photo to Text Conversion API

Text Extractor API

MP3 to Text API

Starts at
$ 10,000/Year