Update VCLLVM (now Pallas) to LLVM 17, update to newest VerCors version, and convert more instructions to COL #1159

superaxander · 2024-02-28T09:01:20Z

Summary

This PR updates VCLLVM to LLVM 17 and updates it to work with the current VerCors. Additionally more instructions converted into COL, there is support for loops, pointers, structs, and more. This also includes changes to the C and PVL frontends for pointers and ByValueClass encoding. (i.e. structs)

Detailed list of changes

General

Ignore a quantifier in SimplifyNestedQuantifiers if it contains a non-inline trigger (it was erroneously rewriting a generated trigger)
Make the ParBlockEncoder treat a unary minus as a constant operator (this was needed for one of the HaliVer examples to verify)
Update to silicon commit viperproject/silicon@2030e3e to improve verification performance for quantified heap chunks (PR: Consolidating quantified field and predicate chunks viperproject/silicon#860)
Add clang-format to the commit hooks to format the C++ code before every commit

Pallas

Rename VCLLVM to Pallas
Update Pallas to LLVM 17 from LLVM 15
Switch from Pallas as a standalone application to a plugin that can be loaded by the LLVM framework
Add support for more LLVM-IR instructions
- AllocA (allocating variables on the stack eliminating the need for the mem2reg pass before verification)
- GetElementPtr (indexing structs arrays and vectors, currently only with statically known offsets)
- Load/Store (loading and storing values behind pointers)
- SExt, ZExt, Trunc (since we only deal with mathematical integers these are translated as no-ops for now)
Add support for more LLVM-IR types
- Structs
- Pointers (including some very basic function pointer support)
- Arrays/Vectors (these are currently treated as the same)
Add support for more LLVM-IR elements:
- Global variables
- Loops (Branch instructions are transformed into ifs + goto and then the goto's are replaced with actual (while) loops. This uses LLVM's loop analyses to find the header and body of the loop)
- More constant types
- Implicit pointer casts (based on C support described below, but with automatic type inference since LLVM omits pointer types)
Pass in debug locations from the LLVM-IR to VerCors so that error locations in the original source files can be pointed out

C (changes superceding #1172 and #1227)

Change permission syntax such that it always takes a pointer except when unfolding a struct permission
- Permission to write a field f of struct value a becomes Perm(&a.f, write) instead of Perm(a.f, write)
- Permission to write all fields of a struct value a becomes Perm(a, write) (or if a is a pointer to a struct you'd have Perm(a, write) ** Perm(*a, write))
Replace struct encoding with a new one based on ADTs (called ByValueClass in the code) containing pointers (related: Encoding structs (and pointers to structs) #1194, Split Class into ByReferenceClass and ByValueClass #1227)
- Allow casting pointers to structs to pointers to their first element (possibly multiple layers deep), also allow casting these pointers back up
- Allow passing around void pointers to have pointers that might not always point to the same type
- Allow getting a pointer to a struct field
- Ensure struct values are always treated as values instead of references (this was already true in most cases but could not be guaranteed because unrelated passes were allowed to treat the struct as a Java/PVL class)
Enable getting the address of local and global variables (related: Allow the use of pointers to things that are not a reference type #1172)
- Variables that have their address taken are automatically upgraded to pointer types
- Currently no automatic permission annotations are generated for local variables so this may be confusing if you have a local variable that you need to declare access for in a loop invariant. (however in every case where this would occur you would not have been able to verify the program previously because of the address-of operator) Generating permission would likely be feasible
Add a new pointer type that can never be null (also replaces some of the changes in Encode that malloc may return NULL in C (weaken postcondition) #1239)
Improve some of the names (which used to be unknownN in the resulting viper file to help with debugging
Remove Viper field access from triggers where DerefPointer or PointerSubscript are the top-level expression (this is generally better for verification performance)

PVL

Add the * and & operators to PVL to work with pointer values (which could also already be used through the pointer<T> ADT)
Add VerCors option --contract-import-file which allows importing method contracts and types from a PVL file which will replace the function signature/contract of the LLVM function with the same name (this will likely be removed later once Robert's specification stuff gets merged in but is necessary now to test the LLVM frontend)

To do:

Update the wiki

pieter-bos · 2024-02-29T11:55:59Z

Hey nice work, just a quick general comment while you work on this: I've realized that we're starting to do too much logic in the Lang{_}ToCol passes (i.e. the "unskippable" part), things should be in the transformation stage. The solution is to just adopt more nodes into the "core" of col: I think all reasonable language features should live there, so col-core is a ~superset of all our input languages. I don't mind duplicates (so by all means make a col equivalent of LLVM nodes so the initial translation is simple), but I'd encourage you to think about language features we should add into col.

superaxander · 2024-02-29T12:10:45Z

Hey nice work, just a quick general comment while you work on this: I've realized that we're starting to do too much logic in the Lang{_}ToCol passes (i.e. the "unskippable" part), things should be in the transformation stage. The solution is to just adopt more nodes into the "core" of col: I think all reasonable language features should live there, so col-core is a ~superset of all our input languages. I don't mind duplicates (so by all means make a col equivalent of LLVM nodes so the initial translation is simple), but I'd encourage you to think about language features we should add into col.

Yeah that makes sense I've so far not touched any of the transformation stuff so I don't quite know what goes in rewrite/, resolve/, etc. But indeed I already felt like some of the things I'm adding will be relevant more broadly.

formatting hook with clang-format

superaxander · 2024-05-31T11:40:04Z

My current plan is to remove the changes from #1172 such that this can be merged without affecting any other parts of the code. I had some issues with the new Ubuntu 24.02 image (which I'm using to get LLVM-17) but the last few runs seem to have worked instead of being suddenly cancelled.

… me and which aren't

in the transformations stages

…d clean up ClassToRef

…n classToRef

superaxander · 2024-10-01T12:17:32Z

Marking this as ready for review since I basically just want to add some tests now before merging this

…oding

bobismijnnaam · 2024-10-08T12:36:31Z

src/col/vct/col/ast/declaration/global/ClassImpl.scala

  def transSupportArrowsHelper(
      seen: Set[TClass[G]]
  ): Seq[(TClass[G], TClass[G])] = {
-    val t: TClass[G] = TClass(this.ref, typeArgs.map(v => TVar(v.ref)))
+    // TODO: Does this break things if we have a ByValueClass with supers?


Why would it? Do you want to disallow plain assignment for byref classes?

The only thing I can come up with that's relevant is that in C, casting a MyStruct* to FirstMemberOfMyStruct* is well-defined. So in a way, a CStruct ast node does not have any supertypes, but the LangCToCol pass could compile it into a byvalue class, with the only supertype being equal to the first member of the struct. You'd have to account for that in assignment, though.

In addition, since you're targeting C, assigning different struct types makes no sense. So maybe struct types with supers should be disallowed? Then maybe byval classes should be separate from byref classes (not saying you have to do this, just thinking out loud)

I agree it's probably best to not support ByValueClasses with super types for now

bobismijnnaam · 2024-10-08T12:40:07Z

src/col/vct/col/ast/declaration/singular/EndpointImpl.scala

@@ -12,7 +12,7 @@ trait EndpointImpl[G]
  override def layout(implicit ctx: Ctx): Doc =
    Group(Text("endpoint") <+> ctx.name(this) <+> "=" <+> init)

-  def t: TClass[G] = TClass(cls, typeArgs)
+  def t: TClass[G] = TByReferenceClass(cls, typeArgs)


I guess this might be a bit of a code smell: while this is the right choice for now as it reflects what was already the case, I did not have in mind to restrict endpoints of choreographies to be either byref or byval. I only expect fields and methods.

Maybe we can discuss in-office if it's possible or not to leave it open for both here? I thought the whole point of this PR was to leave it open in rewriting code whether a class is byref or byval, but I'm sure I'm missing a lot of (c++) context.

src/col/vct/col/ast/lang/llvm/LLVMGlobalVariableImpl.scala

bobismijnnaam · 2024-10-08T13:24:41Z

src/col/vct/col/origin/Origin.scala

@@ -109,6 +109,9 @@ case class SourceName(name: String) extends NameStrategy {
    Some(SourceName.stringToName(name))
 }

+// Used to disambiguate whether to show a ByValueClass as a class or a struct
+case class TypeName(name: String) extends OriginContent


If you focus on the C frontend, why distinguish between ByValueClass that require a "class" prefix? In addition, there's no PVL syntax, right? So might aswell settle on that ByValueClasses are structs, and that ByRef is class...?

bobismijnnaam · 2024-10-08T13:25:23Z

src/col/vct/col/resolve/Resolve.scala

+          Spec.findClass(name, ctx)
+            .getOrElse(throw NoSuchNameError("class", name, t))
+        )
+      case t @ TByValueClass(ref, _) =>


Just in case: if you make class/struct distinction, maybe you can change the "class" error message below to "struct" as well.

bobismijnnaam · 2024-10-08T14:56:33Z

src/rewrite/vct/rewrite/EncodeAutoValue.scala

+                )) { dispatch(main) }
+              if (mMap.isEmpty) { Let(b, v, m) }
+              else {
+                mMap.foreach(postM =>


I'm probably just slow, maybe we can have a look at this code later

src/rewrite/vct/rewrite/EncodeForkJoin.scala

bobismijnnaam · 2024-10-08T14:59:31Z

src/rewrite/vct/rewrite/EncodeIntrinsicLock.scala

@@ -143,7 +143,7 @@ case class EncodeIntrinsicLock[Pre <: Generation]() extends Rewriter[Pre] {

  override def dispatch(decl: Declaration[Pre]): Unit =
    decl match {
-      case cls: Class[Pre] =>
+      case cls: ByReferenceClass[Pre] =>


Unrelated: should intrinsicLockInvariant for ByValueClass throw an exception or something? Or maybe should we remove it from the Class trait?

For now I assumed it would be fine to just make the lock invariant a fixed true

src/rewrite/vct/rewrite/EncodeResourceValues.scala

bobismijnnaam

There you go! Good work!

bobismijnnaam · 2024-10-09T07:10:12Z

src/rewrite/vct/rewrite/ExtractInlineQuantifierPatterns.scala

-          val (patternsHere, body) = patterns.collect {
-            // We only want to inline lets that are defined inside the quantifier
-            letBindings.having(ScopedStack()) { dispatch(f.body) }
+          localHeapVariables.scope {


Nitpick: maybe the concept of variables and heap variables can be unified behind one scope/trait? They both have reasonable get/set implementations, and heap variables further extend that with an address. Just a thought, might not be worth the effort either?

bobismijnnaam · 2024-10-09T07:10:55Z

src/rewrite/vct/rewrite/GenerateSingleOwnerPermissions.scala

@@ -82,7 +82,7 @@ case class GenerateSingleOwnerPermissions[Pre <: Generation](
          ),
        )

-      case cls: Class[Pre] if enabled =>
+      case cls: ByReferenceClass[Pre] if enabled =>


As an exercise you might consider integrating heap variables into this pass, though maybe you already do so somewhere else?

src/rewrite/vct/rewrite/GenerateSingleOwnerPermissions.scala

src/rewrite/vct/rewrite/LowerLocalHeapVariables.scala

bobismijnnaam · 2024-10-09T07:13:34Z

src/rewrite/vct/rewrite/LowerLocalHeapVariables.scala

+    node match {
+      // Same logic as CollectLocalDeclarations
+      case Scope(vars, impl) =>
+        val (newVars, newImpl) = variables.collect {


Same here, scope.rewrite()?

I don't see the difference, maybe it's even rewriteDefault in this case?

bobismijnnaam · 2024-10-09T11:41:00Z

src/rewrite/vct/rewrite/lang/LangLLVMToCol.scala

+        (a, b) match {
+          case (None, _) | (_, None) => None
+          case (Some(a), Some(b))
+              if a == b || rw.dispatch(a) == rw.dispatch(b) ||


Comparing things in Post is highly dubious! Might trigger comparison of some uninitialized refs. I know sycl used to do this every once in a while and it's a pain.

Is it maybe possible to make a more explicit ordering for a fragment of types? As this is for llvm, you only need to do this for say ints, structs, and the spec types (seq/map etc.). And for the spec types you can maybe just say to not let them interleave with llvm types maybe, to simplify things? Relying on the behaviour of another rewriter is fragile. It's also fine if your ordering is very limited and needs to be extended every once in a while. What about only using moreSpecific, and maybe making moreSpecific more complete for the identity cases?

Yeah I knew this was dubious when I wrote it. I do it because I'm mixing LLVM and PVL in places and their types will only match after the LangSpecificToCol pass has finished. This was mostly a cop-out for not having to write out every LLVM type that might match a PVL type and vice-versa.

bobismijnnaam · 2024-10-09T11:41:38Z

src/rewrite/vct/rewrite/lang/LangLLVMToCol.scala

+      }
+      if (subMap.isEmpty) { value }
+      else {
+        // TODO: Support multiple guesses?


Only if there's an actual use case, I don't imagine any sane compiler does this (I'm probably wrong! :D)

src/rewrite/vct/rewrite/lang/LangSpecificToCol.scala

src/rewrite/vct/rewrite/lang/LangTypesToCol.scala

src/rewrite/vct/rewrite/lang/NoSupportSelfLoop.scala

superaxander force-pushed the pallas branch from 6cf3297 to 24b2e2d Compare March 14, 2024 10:52

superaxander force-pushed the pallas branch from 489697e to e7482b8 Compare May 8, 2024 08:55

superaxander force-pushed the pallas branch from e7482b8 to 685d5c6 Compare May 29, 2024 11:05

Rename VCLLVM to Pallas, Update to LLVM 17, Implement Stack Allocation

27a0383

superaxander force-pushed the pallas branch from 685d5c6 to 27a0383 Compare May 30, 2024 07:50

superaxander added 3 commits May 30, 2024 17:10

Fixed the LLVM tests, add a blame for LLVM generated nodes, add

5cb4a2e

formatting hook with clang-format

Use absolute path for finding Pallas shared library

0088121

Remove pallas binary from output jar

7b884e6

superaxander added 19 commits May 31, 2024 15:31

Create separate classes for by reference and by value classes

d52394f

Use ADT encoding for ByValueClasses

b01df4d

Add some axioms that speed up pointer verification

1666b35

Do no copy in expressions which do not yield TByValueClass

0dfde7c

Enable use of methods on by-value classes

be64b27

Set --useOldAxiomatization to test which test failures are because of…

aed3ba6

… me and which aren't

Improve struct encoding, rewrite tests for new permission syntax

53f19a8

Make the pointer for struct fields implicit simplifying most locations

7b50ef3

in the transformations stages

Merge remote-tracking branch 'origin/dev' into class-by-value

28eaee6

Fix the type numbers

1a31edd

Replaced type numbers with constants for ByValueClass

3f9f02b

Temporarily set a fork of silicon in build.sc to test in CI

bcd96b5

Update silver, clean up unused ByValueClass axioms

24197b8

First working version pointer casts

653cd5f

Also get rid of casts from Object to another class

88305b8

Ignore quantifier in SimplifyNestedQuantifiers if it has a trigger an…

d736fdd

…d clean up ClassToRef

Fix compilation error

2615361

Reduce code duplication in adtPointer, remove all non-pointer casts i…

6c8be0a

…n classToRef

Fix duplicate OptGet and add asType function to primitive pointer arrays

e141bcf

superaxander added 12 commits August 22, 2024 15:23

Add type checking for pointer casts

800f1dd

Add back blame erroneously removed by the previous commit

c7a723e

Fix unsupported cast test

f025706

Merge branch 'class-by-value' into pallas

05eec17

Make the LLVM file verify again

ca363da

Implement basic pointer type inference for LLVM

845dac4

Pass-through debug locations from LLVM

0e9751a

Improved pointer type inference

c8548db

Fix crash when transforming fib.ll

05be047

Convert LLVM loops into COL

6ebf84b

Fix broken test with LLVM pure functions

55b69e3

Fix unsoundness in pointer cast encoding

72cb421

superaxander mentioned this pull request Sep 17, 2024

Remove unused nlohmann/json dep for vcllvm.origin module #1238

Closed

Allow casting back up to "greater" type

bede2fc

superaxander force-pushed the pallas branch from df9ebc4 to bede2fc Compare September 20, 2024 14:21

superaxander added 3 commits September 30, 2024 16:55

Move PointerAdd logic for PointerLocations to ImportPointer

6cda75c

Merge remote-tracking branch 'origin/dev' into pallas

2434996

Add pointer post condition in attempt to fix injectivity issue

82b53fe

superaxander marked this pull request as ready for review October 1, 2024 12:17

superaxander added 3 commits October 4, 2024 13:27

Make all but one HaliVer example verify again with the new struct enc…

e2ce374

…oding

Merge remote-tracking branch 'origin/dev' into pallas

5f32f5b

Merge PointerArray fallibility and nullability

dee46c0

superaxander requested a review from bobismijnnaam October 4, 2024 14:49

Clean up the PR

8566c4b

superaxander force-pushed the pallas branch from a4991e2 to 8566c4b Compare October 7, 2024 12:11

bobismijnnaam reviewed Oct 8, 2024

View reviewed changes

bobismijnnaam reviewed Oct 9, 2024

View reviewed changes

superaxander added 2 commits October 9, 2024 17:00

Integrate Bob's feedback

60b419d

Merge remote-tracking branch 'origin/dev' into pallas

283fb6f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update VCLLVM (now Pallas) to LLVM 17, update to newest VerCors version, and convert more instructions to COL #1159

Update VCLLVM (now Pallas) to LLVM 17, update to newest VerCors version, and convert more instructions to COL #1159

superaxander commented Feb 28, 2024 •

edited

Loading

pieter-bos commented Feb 29, 2024

superaxander commented Feb 29, 2024

superaxander commented May 31, 2024

superaxander commented Oct 1, 2024

bobismijnnaam Oct 8, 2024

bobismijnnaam Oct 8, 2024

superaxander Oct 9, 2024

bobismijnnaam Oct 8, 2024

bobismijnnaam Oct 8, 2024

bobismijnnaam Oct 8, 2024

bobismijnnaam Oct 8, 2024

bobismijnnaam Oct 8, 2024

superaxander Oct 9, 2024

bobismijnnaam left a comment

bobismijnnaam Oct 9, 2024

bobismijnnaam Oct 9, 2024

bobismijnnaam Oct 9, 2024

bobismijnnaam Oct 9, 2024

bobismijnnaam Oct 9, 2024

superaxander Oct 9, 2024

bobismijnnaam Oct 9, 2024

Update VCLLVM (now Pallas) to LLVM 17, update to newest VerCors version, and convert more instructions to COL #1159

Are you sure you want to change the base?

Update VCLLVM (now Pallas) to LLVM 17, update to newest VerCors version, and convert more instructions to COL #1159

Conversation

superaxander commented Feb 28, 2024 • edited Loading

Summary

Detailed list of changes

General

Pallas

C (changes superceding #1172 and #1227)

PVL

To do:

pieter-bos commented Feb 29, 2024

superaxander commented Feb 29, 2024

superaxander commented May 31, 2024

superaxander commented Oct 1, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bobismijnnaam left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

superaxander commented Feb 28, 2024 •

edited

Loading