TITLE:
Document-Centric Automation: A Comprehensive Approach to Word, PowerPoint, and PDF Processing
AUTHORS:
Pullaiah Babu Alla
KEYWORDS:
Robotic Process Automation, UiPath, Document Understanding, AI, NLP, Azure AI Document Intelligence, Intelligent Document Processing (IDP)
JOURNAL NAME:
Journal of Computer and Communications,
Vol.13 No.10,
October
15,
2025
ABSTRACT: The modern enterprise automation strategies now heavily rely on Robotic Process Automation (RPA) because document processing stands as a fundamental use case due to the widespread presence of semi-structured and unstructured data. This research investigates how UiPath RPA integrates document processing functions by demonstrating automation of Word, PowerPoint, and PDF documents. The paper demonstrates how UiPath connects to different document types to extract meaningful information through invoice processing as a real-world example while performing string and Regex operations for effective data utilization. The paper focuses on how UiPath handles structured and unstructured PDFs through its built-in activities, OCR techniques, Document Understanding framework, and Azure AI Document Intelligence integration. The paper delivers operational knowledge and technical instructions to RPA developers who want to establish intelligent document processing workflows in UiPath.