Issues: Unstructured-IO/unstructured
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
CCT Something isn't working
measure-table-structure-accuracy-command
doesn't drop index
bug
#2962
opened May 2, 2024 by
mallorih
docs: Add docs showing users can set which OCR agent to use
documentation
Improvements or additions to documentation
#2961
opened May 2, 2024 by
Coniferish
feat: enable users to define retry logic when using New feature or request
ingest
partition_via_api
enhancement
#2948
opened Apr 29, 2024 by
Coniferish
feat/docx-field-codes
docx
Related to Microsoft Word (.docx) file format
enhancement
New feature or request
#2944
opened Apr 27, 2024 by
erik-squared
Text Extraction Issue: Greek Language PDFs Rendered with Incorrect Alphabet
bug
Something isn't working
ocr
Related to optical character recognition (OCR).
#2939
opened Apr 26, 2024 by
DarioBernardo
Clarify Improvements or additions to documentation
enhancement
New feature or request
orig_elements
documentation
documentation
#2929
opened Apr 25, 2024 by
Marcell-Balint
chore: Update unstructured-client
bug
Something isn't working
#2924
opened Apr 23, 2024 by
Coniferish
infer_table_structure
lead Failed to initialize the model
bug
#2923
opened Apr 23, 2024 by
spongxin
infer_table_structure
in partition_pdf
function causes CUDA RuntimeError
bug
#2922
opened Apr 22, 2024 by
naity2
bug/Execution speed is very slow in AWS LAMBDA environment
investigating
Issues that require more information before they are actionable
#2916
opened Apr 22, 2024 by
cds-code
Doc/Docx with Checkboxes
docx
Related to Microsoft Word (.docx) file format
enhancement
New feature or request
#2912
opened Apr 19, 2024 by
Rob-Smith-HDT
Documentation for Partitioning table for email has wrong class type
documentation
Improvements or additions to documentation
#2907
opened Apr 19, 2024 by
debasisdwivedy
bug: TesseractError: Estimating resolution as X
bug
Something isn't working
ocr
Related to optical character recognition (OCR).
#2900
opened Apr 17, 2024 by
qued
Documentation for Ingestion of wikipedia
documentation
Improvements or additions to documentation
#2899
opened Apr 17, 2024 by
debasisdwivedy
bug/partition_pdf removes spaces from the text
bug
Something isn't working
pdf
#2896
opened Apr 16, 2024 by
christinestraub
bug/executing partition_doc using concurrent futures
investigating
Issues that require more information before they are actionable
#2891
opened Apr 15, 2024 by
salahaz
Unable to import unstructured.partition.xyz
bug
Something isn't working
#2888
opened Apr 14, 2024 by
flaviobrienza
Chunk overlap prefix is on even word boundary >= overlap character count.
chunking
Related to element chunking.
enhancement
New feature or request
#2886
opened Apr 12, 2024 by
scanny
bug/unexpected kwarg in MongoDB Destination Connector
bug
Something isn't working
#2878
opened Apr 11, 2024 by
ron-unstructured
File Not Found Error nlp/english-words.txt
bug
Something isn't working
#2859
opened Apr 5, 2024 by
taaha3244
bug/partion_pdf import statement not completing execution
bug
Something isn't working
#2847
opened Apr 4, 2024 by
viboognesh
feat/add New feature or request
extract_image_block_output_dir
to partition_via_api
enhancement
#2833
opened Apr 2, 2024 by
awalker4
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-04-02.