-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Documentation #8
Comments
That sounds good idea to add more integration with textract. Let me put a few more examples with that. Do you have any other feedbacks? |
Summary
|
hi @datascienceteam01, pretty good comments. I am doing lots of ingestion and you have valid point there. However, if excelcy trying to tap into those problems, the project scope is going to be over the places. That is why I am using other packages to get help on the data transformation (for example image -> text). I think the points should be more relevant to the textract package. I am adding the extra documentation as per your suggestion. Going to release the newer version. Thanks |
Would it be possible for the project to have a full extraction example from png or pdf, into training (or using of pre-existing model) and to the point of writing output?
textract is pretty good but it assumes a couple things wrong, like for instance that every pdf file can be consumed in the same way.
The text was updated successfully, but these errors were encountered: