What is our primary use case?
There was a use case for the healthcare domain where we have to process dental claims and EOB documents. For that document, we have to extract the information in the structured tabular format as well as key-value pairs and checkboxes from the dental claims and the explanation of benefits document. This extracted information will be useful for further processing.
Whenever we pass this document to the OCR, it is helpful for extracting the information from the document. For example, in the dental claim document, we have the patient name, doctor's name, and what procedure was conducted on the patient by the doctors. So all this information we have to extract from the given document in the tabular format, or checkboxes, or the signatures from the document. Amazon Textract helps me to extract tabular forms, signatures, and key-value pairs separately.
What is most valuable?
Amazon Textract is good at extracting the key-value pairs and providing coordinates for the particular key-value or text which you are extracting from the document. It is very useful to get the coordinates, as we can map that coordinate again into the document and showcase where exactly the extracted information is located in the document.
I am mostly using the OCR feature of Amazon Textract.
With the help of Amazon Textract, we are reducing labor and manpower because with just one API call, we can get all the extracted information with its coordinates. Doing that by coding or using other open source OCR solutions is not as effective as using Amazon Textract, which helps reduce manpower.
Extracting the information with Amazon Textract is super easy. With one API call, we can extract the information, and the extracted information is correct along with the properly correct coordinates. I would recommend it to others for sure.
What needs improvement?
The main benefit of using Amazon Textract needs improvement, specifically when we extract the tabular data, it is very complicated to get its coordinate functions. We get the coordinates for child and its parent and its child, which is very difficult to extract for the tabular structure or CSV format. Sometimes the tabular data does not process properly for complex tabular structures or complex tables. It would be helpful if we could improvise the model or the OCR solution to extract the complex tabular data and get its coordinates or bounding boxes for the tables, especially for complex tables.
Apart from this tabular data, there is also no solution for checkbox detection. It would be helpful if we got a solution to detect the checkbox and get the bounding boxes for the checkbox and its value.
For how long have I used the solution?
I have used Amazon Textract for almost two to three months. After that, I am also using Bedrock for a few of the service projects.
What was my experience with deployment of the solution?
I have not integrated Amazon Textract with other AWS services such as Comprehend or Augmented AI.
We have integrated Amazon Textract with Bedrock. Whatever information we extract, we again pass it to the LLM call and we do some preprocessing and post-processing on that, getting useful information.
Which solution did I use previously and why did I switch?
For savings, I cannot provide information because we directly chose the Amazon Textract solution. We mostly work with AWS services, so we directly use Amazon Textract without considering any other services from Google or Azure. Hence, I cannot comment on the cost.
How was the initial setup?
I was not involved in the initial setup of Amazon Textract or the onboarding part.
What about the implementation team?
Most of the setups are done by the DevOps or framework engineers, so I did not face any difficulty while utilizing Amazon Textract at the initial level.
What was our ROI?
Amazon Textract is perfect for auditing because I did not consider it from the perspective of auditing and compliance. I think it performs well in this aspect.
What's my experience with pricing, setup cost, and licensing?
I did not think about the cost that much because we have to fulfill the customer's requirement. Whatever they demand, we do not think about cost at first. Whatever the cost involved, we will pay for that and use the services.
Which other solutions did I evaluate?
Our client requires the solution with the help of AWS premises or services only, that is why we always choose Amazon Textract.
What other advice do I have?
I don't have any specific advice for now, but if I think of anything, I will surely share it. On a scale of 1-10, I rate Amazon Textract a 7 out of 10.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)