Skip to main content

πŸ“„ Document Extraction: Bring Your Files to Life

Imagine your desk full of files: PDF reports, printed contracts, scanned invoices, and even meeting notes. What if you could, instead of just reading them, have a conversation with them?

Document Extraction is the technology that makes this possible. It works as a "reading and interpretation" superpower for YuIA, allowing it to open, read, and understand the content of virtually any file, transforming static information into interactive knowledge.

πŸ“Ž How to Upload a Document​

Sending a file to YuIA is as simple as attaching a file to an email:

  1. Drag and drop the file directly into the chat window, or click the βž• button next to the message box and select Upload Files.
  2. The file will appear attached to your message. Now just type your question about the document and send it.
  3. YuIA reads the content and responds based on it.
tip

πŸ’‘ Tip: You can send multiple files at once and ask questions that cross-reference information between them. E.g.: "Compare the results from the January report with February."

πŸ“‹ Supported Formats​

YuIA can read and extract information from a wide variety of formats:

TypeFormats
DocumentsPDF (text and scanned), Word (.docx), PowerPoint (.pptx)
SpreadsheetsExcel (.xlsx), CSV
TextTXT, Markdown (.md), HTML
ImagesJPG, PNG, GIF, WebP (including photos of documents, whiteboards, or notes)
ManuscriptsPhotos of handwritten notes, drafts, and texts

✨ What Can You Do in Practice?​

Document extraction unlocks a world of possibilities. Here are some examples:

  • Chat with your PDFs: Upload a technical report and ask: "Summarize the key points in 5 bullet points" or "Find the section about last quarter's marketing budget."
  • Digitize the Physical World: Take a photo of a recipe from an old book, a service invoice, or your class notes. YuIA transforms the image into text you can search, edit, and analyze.
  • Extract Structured Data: Upload a stack of scanned invoices and ask: "Extract the supplier name, due date, and total amount from each one and give me the results as a table."
  • Analyze Spreadsheets: Upload an Excel spreadsheet and ask: "Which product had the most sales?" or "Create a bar chart with this data."
  • Preserve Context: The AI doesn't just read the text: it also understands the document's structure. It knows the difference between a heading, a paragraph, an image caption, and a footnote, which makes responses much more accurate.

πŸ› οΈ The Engines Behind the Magic​

To handle so many types of files, YuIA uses different "lenses" or extraction engines, automatically choosing the best one for each task.

  • Optical Character Recognition (OCR): Transforms text images (from scanned PDFs or photos) into real digital text. It's what enables "reading" the shape of the letters.
  • Layout Analysis: Understands the visual structure of a page, identifying headings, tables, paragraphs, and lists, and maintaining the original information context.
  • Entity Extraction: Finds and extracts specific information such as people's names, dates, monetary values, addresses, and phone numbers.

The platform automatically chooses the best combination of engines for your file, ensuring the most accurate extraction possible.

πŸ’Ύ Saving Documents for Permanent Use​

Uploaded a document in the chat and want to reuse it in future conversations? Instead of sending it again every time, save it to a Knowledge Base. That way, just use the # command to access the document in any conversation, at any time.