Add New Rule Builder: Tactic Generator #70

yangky11 · 2023-10-07T21:24:44Z

This PR adds a new rule builder, allowing the user to tag an MVarID -> MetaM (Array (String × Float)) as a "tactic generator". For example,

@[aesop unsafe 80% tactic]
def x : MVarId → MetaM (Array (String × Float)) := fun _ => do
  return #[("rw [Nat.add_comm b]", 0.5), ("rw [Nat.add_assoc]", 0.9), ("rw [Nat.add_comm]", 0.8)]

Here the scores (0.5, 0.9, 0.8) are "probability modifiers". They are multiplied to the success probabilities during proof search.

Please see the new AesopTest/TacGen.lean file for a complete use case.

The user is free to make the tactic generator arbitrarily complex (e.g., using LLMs).

…od, along with up/downstream parts

Kaiyu lean infer dev add detection for hidden sorry/admit

…ang-LeanInfer-dev

minor fix

Peiyang lean infer dev

Refactor

JLimperg · 2023-10-09T15:09:36Z

Hey Kaiyu! Thanks for the nice PR. I've made various changes (hopefully improvements), but didn't want to clobber your master branch, so I've pushed them here:

https://github.com/JLimperg/aesop/tree/aesop-llm

Could you look at these to check whether anything seems wrong to you?

yangky11 · 2023-10-09T16:04:27Z

Hi @JLimperg, thank you for the changes; they look great to me. I'll try to integrate it with LLMs and do more testing. I'll post an update later.

yangky11 · 2023-10-09T16:20:12Z

Hi Jannis, I see you're also working on the script builder (not in this PR). How far are we from always being able to produce a proof script whenever the aesop tactic succeeds? The script builder is important for LLM-aesop to become useful, because LLMs may not generate the same set of tactic suggestions every time. It's important for the user to be able to replace the aesop tactic with the actual tactics found during proof search.

yangky11 · 2023-10-09T16:42:28Z

I have tested https://github.com/JLimperg/aesop/tree/aesop-llm in the dev branch of LeanInfer: https://github.com/lean-dojo/LeanInfer/blob/dev/LeanInferTests/Aesop.lean. It works great!

I plan to integrate this feature into the next release of LeanInfer, which will probably come out later this month (together with ongoing improvements in the speed and quality of tactic generation).

A minor question: The 100% here doesn't really play any role (since the probabilities are produced by the model)? If so, can we get rid of it?

JLimperg · 2023-10-09T16:50:30Z

How far are we from always being able to produce a proof script whenever the aesop tactic succeeds?

Producing a functioning script is not so difficult and should already work, apart from small bugs. E.g. in AesopTest.List, there's only one Aesop call that doesn't produce a functioning script. If you do set_option aesop.check.script true, Aesop will check whether the generated script would solve the goal.

What I'm currently working on is making these scripts more idiomatic (no weird tactics, proper structure, etc.). This is, as it turns out, quite annoying.

I have tested https://github.com/JLimperg/aesop/tree/aesop-llm in the dev branch of LeanInfer: https://github.com/lean-dojo/LeanInfer/blob/dev/LeanInferTests/Aesop.lean. It works great!

Nice, happy to hear! I'll merge this into Aesop master then.

A minor question: The 100% here doesn't really play any role (since the probabilities are produced by the model)? If so, can we get rid of it?

I could maybe special-case TacGen rules. Will see.

JLimperg · 2023-10-10T11:54:07Z

Closing this PR since the aesop-llm changes are now in master.

Peiyang-Song and others added 30 commits September 12, 2023 08:56

Finish a RuleTac and a RuleBuilder to run neural network under the ho…

35dd673

…od, along with up/downstream parts

Bump to LeanInfer 2.0.0, simplify additional installation steps

5919da4

Bump to LeanInfer v0.0.4

03fd2aa

Add script builder for neural

8d1adba

Bump to LeanInfer v0.0.7

e34adc4

Add Aesop-LeanInfer readme

3bc8350

Separate Aesop-LeanInfer readme from original Aesop

ad2b3e8

Create Aesop-README.md

9d410b5

Add build check example

2cdbed4

Link update

1804f21

Add exception handling and fix specific-tactic bug

2671f5e

Rename package

abe4b7b

Remove or suggested by the model: fake proofs

2e9134f

Detect when part of tactic is sorry or admit

1ef6a4a

Increase number of candidate sequences to 64

100fcd3

Add reminder of LeanInfer cloud build not working

de4d756

detect sorries

002ceca

lake update

a86552e

restore states

1abe960

bump to leanprover/lean4:v4.2.0-rc1

5ea54ad

rollback to v4.0.0

5ddb1a3

Merge pull request #1 from Peiyang-Song/Kaiyu-LeanInfer-dev

874d274

Kaiyu lean infer dev add detection for hidden sorry/admit

Disable timeout

a25763f

Merge remote-tracking branch 'origin/peiyang-LeanInfer-dev' into peiy…

4dfb047

…ang-LeanInfer-dev

minor fix

0dc1f4a

Minor fix to get rid of PANIC

4fb28bc

Merge pull request #2 from Peiyang-Song/Kaiyu-LeanInfer-dev

3442b8e

minor fix

Merge pull request #3 from Peiyang-Song/peiyang-LeanInfer-dev

6fce70e

Peiyang lean infer dev

Merge branch 'peiyang-LeanInfer'

73186b1

simplify

ee63cbe

Kaiyu Yang and others added 7 commits October 7, 2023 13:40

use LLM-generated scores

f1977a4

use the tactic rule builder

7669ebc

further simplify

744bae6

clean

76375b4

leanprover/lean4:v4.2.0-rc1

c238fd6

clean

76de841

Merge pull request #4 from Peiyang-Song/refactor

4bc99ff

Refactor

JLimperg closed this Oct 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add New Rule Builder: Tactic Generator #70

Add New Rule Builder: Tactic Generator #70

yangky11 commented Oct 7, 2023 •

edited

Loading

JLimperg commented Oct 9, 2023

yangky11 commented Oct 9, 2023

yangky11 commented Oct 9, 2023 •

edited

Loading

yangky11 commented Oct 9, 2023 •

edited

Loading

JLimperg commented Oct 9, 2023

JLimperg commented Oct 10, 2023

Add New Rule Builder: Tactic Generator #70

Add New Rule Builder: Tactic Generator #70

Conversation

yangky11 commented Oct 7, 2023 • edited Loading

JLimperg commented Oct 9, 2023

yangky11 commented Oct 9, 2023

yangky11 commented Oct 9, 2023 • edited Loading

yangky11 commented Oct 9, 2023 • edited Loading

JLimperg commented Oct 9, 2023

JLimperg commented Oct 10, 2023

yangky11 commented Oct 7, 2023 •

edited

Loading

yangky11 commented Oct 9, 2023 •

edited

Loading

yangky11 commented Oct 9, 2023 •

edited

Loading