Skip to content

taesiri/ArXivQA

Repository files navigation

Automated Question Answering with ArXiv Papers

Latest 25 Papers

  • View Selection for 3D Captioning via Diffusion Ranking - [Arxiv] [QA]
  • Language Imbalance Can Boost Cross-lingual Generalisation - [Arxiv] [QA]
  • OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments - [Arxiv] [QA]
  • Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models - [Arxiv] [QA]
  • Rho-1: Not All Tokens Are What You Need - [Arxiv] [QA]
  • Lyapunov-stable Neural Control for State and Output Feedback: A Novel Formulation for Efficient Synthesis and Verification - [Arxiv] [QA]
  • On Unified Prompt Tuning for Request Quality Assurance in Public Code Review - [Arxiv] [QA]
  • AmpleGCG: Learning a Universal and Transferable Generative Model of Adversarial Suffixes for Jailbreaking Both Open and Closed LLMs - [Arxiv] [QA]
  • High-Dimension Human Value Representation in Large Language Models - [Arxiv] [QA]
  • Overparameterized Multiple Linear Regression as Hyper-Curve Fitting - [Arxiv] [QA]
  • Fuss-Free Network: A Simplified and Efficient Neural Network for Crowd Counting - [Arxiv] [QA]
  • Heron-Bench: A Benchmark for Evaluating Vision Language Models in Japanese - [Arxiv] [QA]
  • Sparse Laneformer - [Arxiv] [QA]
  • From the Lab to the Theater: An Unconventional Field Robotics Journey - [Arxiv] [QA]
  • Discourse-Aware In-Context Learning for Temporal Expression Normalization - [Arxiv] [QA]
  • An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization - [Arxiv] [QA]
  • Joint Conditional Diffusion Model for Image Restoration with Mixed Degradations - [Arxiv] [QA]
  • RMAFF-PSN: A Residual Multi-Scale Attention Feature Fusion Photometric Stereo Network - [Arxiv] [QA]
  • Generating Synthetic Satellite Imagery With Deep-Learning Text-to-Image Models -- Technical Challenges and Implications for Monitoring and Verification - [Arxiv] [QA]
  • Mitigating Vulnerable Road Users Occlusion Risk Via Collective Perception: An Empirical Analysis - [Arxiv] [QA]
  • Reframing the Mind-Body Picture: Applying Formal Systems to the Relationship of Mind and Matter - [Arxiv] [QA]
  • Reflectance Estimation for Proximity Sensing by Vision-Language Models: Utilizing Distributional Semantics for Low-Level Cognition in Robotics - [Arxiv] [QA]
  • Chaos in Motion: Unveiling Robustness in Remote Heart Rate Measurement through Brain-Inspired Skin Tracking - [Arxiv] [QA]
  • Run-time Monitoring of 3D Object Detection in Automated Driving Systems Using Early Layer Neural Activation Patterns - [Arxiv] [QA]
  • Finding Dino: A plug-and-play framework for unsupervised detection of out-of-distribution objects using prototypes - [Arxiv] [QA]

List of Papers by Year

Acknowledgements

This project is made possible through the generous support of Anthropic, who provided free access to the Claude-2.1 API.