Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HTML格式表格导入,分块无表头 #1450

Open
1 of 5 tasks
Essence9999 opened this issue May 11, 2024 · 3 comments
Open
1 of 5 tasks

HTML格式表格导入,分块无表头 #1450

Essence9999 opened this issue May 11, 2024 · 3 comments

Comments

@Essence9999
Copy link

例行检查

  • 我已确认目前没有类似 features
  • 我已确认我已升级到最新版本
  • 我已完整查看过项目 README,已确定现有版本无法满足需求
  • 我理解并愿意跟进此 features,协助测试和提供反馈
  • 我理解并认可上述内容,并理解项目维护者精力有限,不遵循规则的 features 可能会被无视或直接关闭

功能描述
用HTML格式,导入表格,分块无表头
image

应用场景
合并单元格等场景,无法用MD table表示;可用HTML表示

@Essence9999
Copy link
Author

本来HTML格式能表示出表格真实情况,如合并单元格;但以html录入到知识库中,却成了markdown格式,导致合并单元格部分错误;
建议html格式文件,保持源码,html格式大模型也是可以处理的

@c121914yu
Copy link
Collaborator

c121914yu commented May 12, 2024

html 不转化太大了。
感觉可以单独保留表格 html

@Essence9999
Copy link
Author

html 不转化太大了。 感觉可以单独保留表格 html

嗯嗯,确实存在这种情况,表大的话,html也大,导致复制到知识库被截断;html分块的话,也麻烦,没有了对应的上下文~

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants