Official Code of "STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases"
-
Updated
May 13, 2024 - Python
Official Code of "STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases"
Programming language for symbolic computation with unusual combination of pattern matching features: Tree patterns, associative patterns and expressions embedded in patterns.
Refinery is a tool to extract and transform semi-structured data from Excel spreadsheets of different layouts in a declarative way.
Implementation of the semi-structured inference model in our ACL 2020 paper, INFOTABS: Inference on Tables as Semi-structured Data.
Repository containing code for the NAACL 2021 paper (Incorporating External Knowledge to Enhance Tabular Reasoning)
Endoscopic and Pathological data extraction for various endo-pathological data extraction
A dataset for extracting information from repair manuals
An ActiveModel extension to model your semi-structured data using embedded associations
This repository contains the official code for the paper : Realistic Data Augmentation Framework for Enhancing Tabular Reasoning.
Urban Dict spelling variant dataset. Source code of How to Evaluate Word Representations of Informal Domain?
Coherent data analysis description language
Schema inference for semistructured data using Formal Concept Analysis
A semi-automatic web-based annotation tool for MyFixit dataset :
Implementation of the semi-structured inference model in our ACL 2023 paper: INFOSYNC: Information Synchronization across Multilingual Semi-structured Tables.
Java Standalone application for querying XML documents with requests with preferences (GTPs requests with preferences)
Endoscopic and Pathological data extraction for various endo-pathological data extraction
Web-based workflow management system that computes candidate tool workflows given input file(s) and the user's requirements regarding the output. Afterwards, runs a workflow selected by the user from the list of candidates. Implemented in Bracmat (~75%) and Java (~25%).
Any2Json Net Classifier Plugin
Eloquent Serialized LOB is a trait for Laravel Eloquent models that allows Serialized LOB pattern
Add a description, image, and links to the semi-structured-data topic page so that developers can more easily learn about it.
To associate your repository with the semi-structured-data topic, visit your repo's landing page and select "manage topics."