[RFC] Input AST of the SAT API #1018

Halbaroth · 2023-12-22T11:37:15Z

I open this issue because I was trying to solve properly the issue #778 yesterday and I got stuck with the SAT solver API. We know that Alt-Ergo produces partial models because of preprocessing and optimizations in the SAT solvers. In #778, we was agreed that a good solution is to collect in d_cnf all the declared symbols and feed the get_model method with them. This solution will work, and actually I wrote the PR already but I wasn't very pleased with it. This solution doesn't solve the issue if you use the SAT API directly. A library's user should expect to get a complete model if he/she calls the get_model method of the SAT API.

After thinking about it, I think that the core issue is the fact we used a preprocessed AST for the input expressions of the SAT API.
The current Expr AST assumes three roles:

The parsed (and typed) AST.
A preprocessed AST.
A CNF AST for the formulas.

I think we should use three different ASTs here:

A subset of the Dolmen AST as input of the SAT API.
For the preprocessed AST, we may use again the Dolmen one but we have to perform the preprocessing phase in the SAT solver at the very beginning of assume. So basically, we could create a new module Preprocess that takes a Dolmen AST and returned a preprocessed one. We may add some memoization here.
We have already a CNF AST for SatML and actually we translate the CNF AST of FunSAT to the CNF AST of SatML. We should push the CNF AST from Expr into FunSAT and use directly the CNF AST of SatML into SatML. Again the transformation of formulas into CNF formulas has to be performed in assume.

A possible roadmap to do this work could be:

Remove the legacy parser and perform some clean-ups.
Move the CNF AST from Expr into FunSAT and remove the translation performed in satml_frontend. The CNF transformation is performed in the SAT solvers.
Create a preprocess module which transforms a Expr AST into a Expr AST.
Use this preprocess at the very beginning of the assume (before the CNF transformation).
Replace Expr by a subset of the Dolmen AST.

The text was updated successfully, but these errors were encountered:

bclement-ocp · 2024-01-02T10:03:19Z

Assuming that by "the SAT API" you mean the Frontend / Sat_solver_sig API, they don't currently have a concept of symbols being declared; switching to using the Dolmen AST instead will not change that.

Long-term pushing the Dolmen AST downwards is probably the move (although we do need to be careful about hash-consing), but I'd say we should wait until we remove the legacy frontend to do so. Could we instead have an API to explicitly declare symbols to the solver, which would be used by d_cnf? The user would then only be guaranteed to have values in the models for the symbols that have been explicitly declared.

We can then revisit that design and come up with a better solution (probably centered around the Dolmen AST as you suggest) once we drop the legacy frontend and the related constraints.

@A

This PR fixes the issue OCamlPro#778. The suggested solution by this PR is as follows: - Collect user-declared symbols in `D_cnf`; - As the function `make` of the module `D_cnf` returns a list of commands, we add a new command `Decl` in `Frontend` to declare symbols; - The declared symbols are given to the SAT solver through a new function `declaration` in `sat_solver_sig`. - These symbols are collected in vectors and we keep indexes in another PR in order to manage properly the push/pop mechanism. Notice that we have to use vectors too in the environment of FunSAT because the current implementation of push/pop commands in this SAT solver doesn't use the persistency of its environment to restore previous environments while processing `pop` command. - Finally the declared symbols are given to `Uf` and we create the empty model with them. Another annoying point comes from the current datastructure used in `ModelMap`. The type of `ModelMap` saves model values in the form of graphs. A graph is a finite set of constraints but there is no appropriate representation for the graph without constraints. Let's imagine we have the input: ```smt2 (set-option :produce-models true) (set-logic ALL) (declare-fun (Int) Int) (check-sat) (get-model) ``` We expect an output as follows: ```smt2 ( (define-fun f (_x Int) (@A as Int)) ) ``` But `Graph.empty` cannot hold the abstract value `@a`. A naive solution consists in using an ADT: ```ocaml type graph = | Empty of Id.typed | G of Graph.t ``` But we'll add an extra indirection. Probably the best solution would be to use a custom datastructure instead of a map to store the constraints. In this PR, we detect in `ModelMap.add` if the only constraint in the graph is of the form `something -> abstract value`. In this case we remove this constraint. This solution isn't perfect as explained in issue OCamlPro#1018. The biggest drawback of this solution is the fact we have to declare symbols in the SAT API, otherwise models returned by `get_model` could be incomplete.

@A

This PR fixes the issue OCamlPro#778. The suggested solution by this PR is as follows: - Collect user-declared symbols in `D_cnf`; - As the function `make` of the module `D_cnf` returns a list of commands, we add a new command `Decl` in `Frontend` to declare symbols; - The declared symbols are given to the SAT solver through a new function `declaration` in `sat_solver_sig`. - These symbols are collected in vectors and we keep indexes in another PR in order to manage properly the push/pop mechanism. Notice that we have to use vectors too in the environment of FunSAT because the current implementation of push/pop commands in this SAT solver doesn't use the persistency of its environment to restore previous environments while processing `pop` command. - Finally the declared symbols are given to `Uf` and we create the empty model with them. Another annoying point comes from the current datastructure used in `ModelMap`. The type of `ModelMap` saves model values in the form of graphs. A graph is a finite set of constraints but there is no appropriate representation for the graph without constraints. Let's imagine we have the input: ```smt2 (set-option :produce-models true) (set-logic ALL) (declare-fun (Int) Int) (check-sat) (get-model) ``` We expect an output as follows: ```smt2 ( (define-fun f (_x Int) (@A as Int)) ) ``` But `Graph.empty` cannot hold the abstract value `@a`. A naive solution consists in using an ADT: ```ocaml type graph = | Empty of Id.typed | G of Graph.t ``` But we'll add an extra indirection. Probably the best solution would be to use a custom datastructure instead of a map to store the constraints. In this PR, we detect in `ModelMap.add` if the only constraint in the graph is of the form `something -> abstract value`. In this case we remove this constraint. This solution isn't perfect as explained in issue OCamlPro#1018. The biggest drawback of this solution is the fact we have to declare symbols in the SAT API, otherwise models returned by `get_model` could be incomplete.

Halbaroth · 2024-01-02T11:07:50Z

Ok, I reword what I mean in this issue. The main problem is the fact some preprocessing computations are performed in the Expr AST before feeding the SAT solver with these expressions.

If we don't do that before giving a formula to assume, it's almost solved the issue with incomplete models because we can scan the formula in assume to collect declared symbols and we don't need to add an ugly declaration primitive to our SAT API. There is only one case where the model is still incomplete with this solution: if you don't use at all the symbol in any asserted formulas. For instance:

(set-option :produce-models true)
(set-logic ALL)
(declare-const x Int)
(check-sat)
(get-model)

I suggest to use a subset of the Dolmen AST because we could simplify a lot D_cnf. Nevertheless, there are some stumbling blocks as the purification or the skolemization of formulas.

Of course, this refactoring requires a lot of work and I don't plan do it for a specific release.

@A

This PR fixes the issue OCamlPro#778. The suggested solution by this PR is as follows: - Collect user-declared symbols in `D_cnf`; - As the function `make` of the module `D_cnf` returns a list of commands, we add a new command `Decl` in `Frontend` to declare symbols; - The declared symbols are given to the SAT solver through a new function `declaration` in `sat_solver_sig`. - These symbols are collected in vectors and we keep indexes in another PR in order to manage properly the push/pop mechanism. - Finally the declared symbols are given to `Uf` and we create the empty model with them. Another annoying point comes from the current datastructure used in `ModelMap`. The type of `ModelMap` saves model values in the form of graphs. A graph is a finite set of constraints but there is no appropriate representation for the graph without constraints. Let's imagine we have the input: ```smt2 (set-option :produce-models true) (set-logic ALL) (declare-fun (Int) Int) (check-sat) (get-model) ``` We expect an output as follows: ```smt2 ( (define-fun f (_x Int) (@A as Int)) ) ``` But `Graph.empty` cannot hold the abstract value `@a`. A naive solution consists in using an ADT: ```ocaml type graph = | Empty of Id.typed | G of Graph.t ``` But we'll add an extra indirection. Probably the best solution would be to use a custom datastructure instead of a map to store the constraints. In this PR, we detect in `ModelMap.add` if the only constraint in the graph is of the form `something -> abstract value`. In this case we remove this constraint. This solution isn't perfect as explained in issue OCamlPro#1018. The biggest drawback of this solution is the fact we have to declare symbols in the SAT API, otherwise models returned by `get_model` could be incomplete.

bclement-ocp · 2024-01-02T11:24:10Z

I am not sure we should be inspecting the formulas for this; it'd be better to have some sort of context object (see e.g. Z3's API) to keep track of declared symbols (to be clear, this context object should have an equivalent of Expr.mk_name that tracks the created identifiers with type context -> string -> symbol, not a context -> symbol -> unit function to use after the fact). This would avoid needing a separate mechanism for the declared-but-unused symbols (which we do need to support for SMT-LIB compliance).

@A

This PR fixes the issue OCamlPro#778. The suggested solution by this PR is as follows: - Collect user-declared symbols in `D_cnf`; - As the function `make` of the module `D_cnf` returns a list of commands, we add a new command `Decl` in `Frontend` to declare symbols; - The declared symbols are given to the SAT solver through a new function `declaration` in `sat_solver_sig`. - These symbols are collected in vectors and we keep indexes in another PR in order to manage properly the push/pop mechanism. - Finally the declared symbols are given to `Uf` and we create the empty model with them. Another annoying point comes from the current datastructure used in `ModelMap`. The type of `ModelMap` saves model values in the form of graphs. A graph is a finite set of constraints but there is no appropriate representation for the graph without constraints. Let's imagine we have the input: ```smt2 (set-option :produce-models true) (set-logic ALL) (declare-fun (Int) Int) (check-sat) (get-model) ``` We expect an output as follows: ```smt2 ( (define-fun f (_x Int) (@A as Int)) ) ``` But `Graph.empty` cannot hold the abstract value `@a`. A naive solution consists in using an ADT: ```ocaml type graph = | Empty of Id.typed | G of Graph.t ``` But we'll add an extra indirection. Probably the best solution would be to use a custom datastructure instead of a map to store the constraints. In this PR, we detect in `ModelMap.add` if the only constraint in the graph is of the form `something -> abstract value`. In this case we remove this constraint. This solution isn't perfect as explained in issue OCamlPro#1018. The biggest drawback of this solution is the fact we have to declare symbols in the SAT API, otherwise models returned by `get_model` could be incomplete.

@A

This PR fixes the issue OCamlPro#778. The suggested solution by this PR is as follows: - Collect user-declared symbols in `D_cnf`; - As the function `make` of the module `D_cnf` returns a list of commands, we add a new command `Decl` in `Frontend` to declare symbols; - The declared symbols are given to the SAT solver through a new function `declaration` in `sat_solver_sig`. - These symbols are collected in vectors and we keep indexes in another PR in order to manage properly the push/pop mechanism. - Finally the declared symbols are given to `Uf` and we create the empty model with them. Another annoying point comes from the current datastructure used in `ModelMap`. The type of `ModelMap` saves model values in the form of graphs. A graph is a finite set of constraints but there is no appropriate representation for the graph without constraints. Let's imagine we have the input: ```smt2 (set-option :produce-models true) (set-logic ALL) (declare-fun (Int) Int) (check-sat) (get-model) ``` We expect an output as follows: ```smt2 ( (define-fun f (_x Int) (@A as Int)) ) ``` But `Graph.empty` cannot hold the abstract value `@a`. A naive solution consists in using an ADT: ```ocaml type graph = | Empty of Id.typed | G of Graph.t ``` But we'll add an extra indirection. Probably the best solution would be to use a custom datastructure instead of a map to store the constraints. In this PR, we detect in `ModelMap.add` if the only constraint in the graph is of the form `something -> abstract value`. In this case we remove this constraint. This solution isn't perfect as explained in issue OCamlPro#1018. The biggest drawback of this solution is the fact we have to declare symbols in the SAT API, otherwise models returned by `get_model` could be incomplete.

@A

* Move typed symbols from ModelMap to Id * Generate complete model This PR fixes the issue #778. The suggested solution by this PR is as follows: - Collect user-declared symbols in `D_cnf`; - As the function `make` of the module `D_cnf` returns a list of commands, we add a new command `Decl` in `Frontend` to declare symbols; - The declared symbols are given to the SAT solver through a new function `declaration` in `sat_solver_sig`. - These symbols are collected in vectors and we keep indexes in another PR in order to manage properly the push/pop mechanism. - Finally the declared symbols are given to `Uf` and we create the empty model with them. Another annoying point comes from the current datastructure used in `ModelMap`. The type of `ModelMap` saves model values in the form of graphs. A graph is a finite set of constraints but there is no appropriate representation for the graph without constraints. Let's imagine we have the input: ```smt2 (set-option :produce-models true) (set-logic ALL) (declare-fun (Int) Int) (check-sat) (get-model) ``` We expect an output as follows: ```smt2 ( (define-fun f (_x Int) (@A as Int)) ) ``` But `Graph.empty` cannot hold the abstract value `@a`. A naive solution consists in using an ADT: ```ocaml type graph = | Empty of Id.typed | G of Graph.t ``` But we'll add an extra indirection. Probably the best solution would be to use a custom datastructure instead of a map to store the constraints. In this PR, we detect in `ModelMap.add` if the only constraint in the graph is of the form `something -> abstract value`. In this case we remove this constraint. This solution isn't perfect as explained in issue #1018. The biggest drawback of this solution is the fact we have to declare symbols in the SAT API, otherwise models returned by `get_model` could be incomplete. * Move comment * Remove useless option * Remove useless DeclSet module * We aren't in C++ :) * Split the test * Use a stack of the stdlib for `declare_ids` * Use a stack of the stdlib for declared symbols in SatML * Use an ADT to discriminate free graph and constant graph * Avoid list concatenation * Add a new error in case of stack underflow * Don't perform substitution into free graphes

Halbaroth added backlog clean-up labels Dec 22, 2023

Halbaroth mentioned this issue Jan 2, 2024

Complete model #1019

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Input AST of the SAT API #1018

[RFC] Input AST of the SAT API #1018

Halbaroth commented Dec 22, 2023 •

edited

bclement-ocp commented Jan 2, 2024

Halbaroth commented Jan 2, 2024 •

edited

bclement-ocp commented Jan 2, 2024

[RFC] Input AST of the SAT API #1018

[RFC] Input AST of the SAT API #1018

Comments

Halbaroth commented Dec 22, 2023 • edited

bclement-ocp commented Jan 2, 2024

Halbaroth commented Jan 2, 2024 • edited

bclement-ocp commented Jan 2, 2024

Halbaroth commented Dec 22, 2023 •

edited

Halbaroth commented Jan 2, 2024 •

edited