Tableaux 2000 benchmarks, logic K

January 2000


These benchmarks are described at the Tableaux 2000 competition homepage. They consist of modal formulas obtained by translating randomly generated quantified boolean formulas. For each test:
  • C are the of 4-literal clauses in the original QBF
  • D are the alternations of quantifiers
  • V is the maximum number of variables under the scope of each quantifier
  • mod denotes that propositional variables in the test are encoded into modal formulas in such a way that only one propositional variable survives.
Three different versions of *SAT are under evaluation:
  • *SAT this is the plain SAT-based algorithm for logic K.
  • *SAT+USB this adds bit-matrix based caching; its distinguishing features are:
    • it can store sets of formulas encoded as sets of integers
    • it can query for subsets (supersets)
    • it can be bounded in size and, if bounded, it allows to use a FIFO politic to discharge obsolete reults
  • *SAT+USH this adds hash-table based caching; its distinguishing features are:
    • it can store sets of formulas encoded as sets of integers
    • features a simple and fast exact-matching access
    • it never forgets results
For each algorithm we show the geometric mean of the Cpu time taken to solve some samples of each problem. A timelimit of 1200 seconds is enforced. >1200 in the table, means that all the samples exceeded the timelimit. Also the geometric mean of the number of Consistency checks is shown. For the samples that exceeed the timelimit we take into account the number of checks performed so far.

Test  *SAT *SAT+USB *SAT+USH
Cpu  Cons  Cpu  Cons  Cpu  Cons 
C10-V4-D4 >1200 736535 102.05 26663 133.82 41898
C20-V4-D4 >1200 654802 321.15 75967 493.66 131996
C30-V4-D4 157.71 98681 18.08 3953 29.36 8253
C10-V4-D6 >1200 533239 1103.23 181517 >1200 202489
C20-V4-D6 >1200 507785 >1200 152006 >1200 32282
C10-V8-D4 >1200 379344 >1200 109023 >1200 125439
C20-V8-D4 >1200 396821 >1200 66377 >1200 78798
C30-V8-D4 >1200 339884 >1200 55121 >1200 64837
C10-V4-D4-mod >1200 4355018 30.20 20398 101.46 98330
C20-V4-D4-mod >1200 4014061 28.99 12796 176.98 134319
C30-V4-D4-mod >1200 3902022 48.49 28928 456.25 258761
C10-V4-D6-mod >1200 3707478 133.06 36251 155.37 140415
C20-V4-D6-mod >1200 3252054 787.94 487360 1038.93 671376
C10-V8-D4-mod >1200 3147085 795.03 157570 >1200 500019
C20-V8-D4-mod >1200 2548269 >1200 262289 >1200 313329
C30-V8-D4-mod >1200 2383854 1021.58 183388 >1200 218947

As can be observed from the table, *SAT without caching is not able to solve most of the problem instances within the time limit. Caching produces dramatic speedups (up to two orders of magnitude) and, particularly, caching with a bit matrix yields the best results reducing consistency checks as well as cpu time.



Back to the state of the art page