The primary objective of this project was to develop an AI-based system capable of classifying text and extracting specific information in real-time from PDF files. The focus was on handling a diverse set of PDF documents, specifically contract tender filing information related to construction work. The documents were in German language, and the project involved training an AI model to classify and extract relevant data. The project successfully achieved the goal of classifying and extracting information from diverse PDF documents in real-time with 95% accuracy. The solution’s deployment on AWS EC2 with autoscaling ensures scalability, making it suitable for handling varying workloads.