Mistral adds a new application programming interface that converts any PDF document to a ready -made discount file


Thursday, the large French language model (LlmDeveloper mistake A new application programming interface has launched developers who deal with complex PDF documents. Bad OCR It is an optical letters programming interface (OCR) that can convert any PDF into a text file to facilitate the absorption of artificial intelligence models.

LLMS, which supports common Genai tools like Openai’s Chatgpt, works well with raw text. So companies that want to create the workflow of their own Amnesty International know that it is extremely important to store and index data in a clean format so that this data can be reused to process artificial intelligence.

Unlike most OCR applications, OCR Mistral is a multimedia application programming interface, which means that it can be discovered when there are illustrations and interlocking pictures with blocks of the text. API is in the view of OCR creates surrounding boxes around these graphic elements and includes them in the output.

Also, the wrong OCR does not come out a large wall of the text; The output is coordinated in Markdown, which is the construction of a total format that developers use to add links, heads and other formatting elements to an ordinary text file.

LLMS depends greatly on their training data discounts. Likewise, when you use AI, such as Mistral’s Le Chat or Openai’s Chatgpt, it often uses a reduction in creating bullet lists, adding bonds or putting some elements in bold. Applications coordination applications smoothly, the frequency directing in the rich text. For this reason, the raw text – and the reduction – has become more important in recent years as Genai has flourished.

“Over the years, the organizations have accumulated many documents, often in PDF formats or slides, which are not accessible to LLMS, especially RAG systems. With Mistral OCR, our customers can now convert rich and sophisticated documents into readable content in all languages.”

He added: “This is a decisive step towards adoption on a large scale for artificial intelligence aides in companies that need to simplify access to their extensive internal documents.”

Mistral OCR is available on Mistral API or through its cloud partners (AWS, Azure, Google Cloud Vertex, etc.). For companies that work with classified or sensitive data, Mistral provides local publishing.

According to the Paris -based Mistral OCR, Mistral OCR is better than Google, Microsoft and Openai. The company has tested its OCR model with complex documents that include mathematical expressions (latex format), advanced plannings or tables. It is also supposed to be better performance with non -English documents.

Image credits:mistake

Given that Mistral OCR does one thing and just one thing, the company believes it is also faster than it is. This is not a surprise if you compare it with multimedia LLM like GPT-4O, which also has OCR capabilities (between a lot Other features).

Mistral also uses Mistral OCR for his artificial intelligence assistant Cat. When the user downloads a PDF file, the company UCR Mistral uses the background to understand what is in the document before processing the text.

Companies and developers are likely to use the OCR Mistral system with the RAG system (also known as Revival-Augmenty) to use multimedia documents as LLM inputs. There are many possible use. For example, we can imagine the use of law firms that they use to help them quickly through large quantities of documents.

RAG is a technique used to recover data and use it as a context with the AI ​​Model.

Leave a Reply

Your email address will not be published. Required fields are marked *