Fusion: Practice and Applications
FPA
2692-4048
2770-0070
10.54216/FPA
https://www.americaspg.com/journals/show/731
2018
2018
An efficient extraction of information from Indian Government issued documents Aadhar and Pan Card
Data Scientist ,Great Learning, India
Rachna
Rachna
Bharati Vidyapeeth's College of Engineering, INDIA
Arun K.
Dubey
Bharati Vidyapeeth's College of Engineering, INDIA
Achin
Jain
Bharati Vidyapeeth's College of Engineering, INDIA
Eshika
Agarwal
Bharati Vidyapeeth's College of Engineering, INDIA
Disha
Mittal
In today's world, everything is getting digitized, and widespread use of data scanning tools and photography. When we have a lot of image data, it becomes important to accumulate data in a form that is useful for the companyorganization. Doing it manually is a tedious task and takes an ample amount of time. Hence to simplify the job, we have developed a FLASK API that takes an image folder as an object and returns an excel sheet of relevant data from the image data. We have used optical character recognition and software like pytesseract to extract data from images. Further in the process, we have used natural language processing, and finally, we have found relevant data using the globe and regex module. This model is helpful in data collection from Registration certificates which helps us store data like chassis number, owner name, car number, etc., easily and can be applied to Aadhaar cards and pan cards.
2021
2021
56
61
10.54216/FPA.040201
https://www.americaspg.com/articleinfo/3/show/731