Fusion: Practice and Applications FPA 2692-4048 2770-0070 10.54216/FPA https://www.americaspg.com/journals/show/731 2018 2018 An efficient extraction of information from Indian Government issued documents Aadhar and Pan Card Data Scientist ,Great Learning, India Rachna Rachna Bharati Vidyapeeth's College of Engineering, INDIA Arun K. Dubey Bharati Vidyapeeth's College of Engineering, INDIA Achin Jain Bharati Vidyapeeth's College of Engineering, INDIA Eshika Agarwal Bharati Vidyapeeth's College of Engineering, INDIA Disha Mittal In today's world, everything is getting digitized, and widespread use of data scanning tools and photography. When we have a lot of image data, it becomes important to accumulate data in a form that is useful for the companyorganization. Doing it manually is a tedious task and takes an ample amount of time. Hence to simplify the job, we have developed a FLASK API that takes an image folder as an object and returns an excel sheet of relevant data from the image data. We have used optical character recognition and software like pytesseract to extract data from images. Further in the process, we have used natural language processing, and finally, we have found relevant data using the globe and regex module. This model is helpful in data collection from Registration certificates which helps us store data like chassis number, owner name, car number, etc.,  easily and can be applied to Aadhaar cards and pan cards. 2021 2021 56 61 10.54216/FPA.040201 https://www.americaspg.com/articleinfo/3/show/731