Skip to content
Oliver Kennedy edited this page Jul 6, 2015 · 4 revisions

Things to Explore

Bootstrapped File Import

A nifty way to show off the power of Mimir would be to be able to just drop a CSV file onto Mimir and be able to start asking questions about it. As it stands, it seems like there are a few challenges in doing so...

  1. Is there a header row? Should row 1 define the attribute names?
  2. If there is no header row, how are the attribute names assigned?
  3. What are the types of each attribute?
  4. What happens to attribute values that don't conform to the proposed spec.

Questions 3 and 4 seem like the easiest to answer up front through a new type-inference lens.

Design an End-to-end test case

Come up with a question, figure out how it would be solved end-to-end in Mimir.

Log Parsing Lens

A lens for log parsing ... basically a list of pairs of regular expressions and projection terms that build a relation.

JSON Parsing Lens

A generalization of the schema matching lens that coerces a table of JSON objects into a fixed schema.

State Machine Lens

Encode a state machine (somehow?) that operates over sequences of records (e.g., for record linkage). Error states or invalid final states trigger new variables.