Using a PPX to Modify the AST of Another Module

ComanderP · August 9, 2024, 5:15pm

Hello OCaml community,

I’m currently exploring the capabilities of pps in a personal project and was wondering if it is possible to use a PPX to modify the AST of a different module within the same library. Specifically, I’d like to achieve something along these lines:

In one file, I define a type, for example:
```
type foo = Foo
```
Then, using a PPX on the previous definition, I want to add a constructor to a different type (let’s call it Bar) in another module within my library. For example:
```
type bar = Bar1 | Bar2
```
After running the PPX, I want the bar type to be automatically updated to include a new constructor FooBar, resulting in:
```
type bar = Bar1 | Bar2 | Foo
```

I’m interested in understanding whether it’s feasible to implement such behavior using a PPX, and if so, what the general approach would be. Specifically:

Is it possible for a PPX to modify the AST of a different module?
What would be the main challenges or limitations associated with this approach? I know extensible variant types are a thing for example, but I would like to refrain from using them.
Are there any existing PPX examples or projects that achieve something similar?

Any guidance or examples would be greatly appreciated!
Thank you in advance for your help.

sim642 · August 10, 2024, 8:05am

The only somewhat similar thing I can think of is ppx_import, but I don’t think it has what you need out of the box.
AFAIK ppx_import is quite special as to how it works, but maybe you could borrow from it to do some cross-module preprocessing.

Besides extensible variant types, there are also polymorphic variants which easily allow one type to be included in another, which maybe is what you need?

ComanderP · August 10, 2024, 3:30pm

If I understand correctly, using polymorphic variants would require me to change the definition of the type in the 2nd module every time I want to add a new type in the 1st module, right? If so that defeats the purpose of what I’m trying to do.
Thanks for the info on ppx_import though! Maybe the easy way for now is to use an extensible variant type…

sim642 · August 10, 2024, 3:37pm

No, for example:

type a = [`Foo | `Bar]
type b = [a | `Baz]

ComanderP · August 10, 2024, 7:00pm

I see. I don’t think that’s exactly what I want, but thank you anyway!

davesnx · August 12, 2024, 7:31am

Are you trying to make & type operator from TypeScript into OCaml, by any chance?

ComanderP · August 13, 2024, 10:52pm

Not really, just a personal project where I would like the user to be able to “register” some types in the library.

Chet_Murthy · August 16, 2024, 2:42am

I’ve been meaning to reply to you but I never remember to do so when I’m at my laptop (only at my phone). So this is late. What you’re trying to do might not be very hard, actually.

I don’t know how PPXlib works, but in principle, a PPX rewriter could easily add information to a “global context” which could be passed from one PPX rewriter to the next, and could affect the subsequent PPX rewriter. Again, I don’t know how PPXlib does it, but in the pa_ppx family of PPX rewriters (based on Camlp5, so not compatible with PPXlib), the ppx_deriving rewriter works as two passes.

If you recall, in ppx_deriving, there’s a rule about how names are scoped, so that if two derivers (let’s say d1 and d2) both want to use an attribute (like [@name ...], hence a name-clash) they can arrange so that to use @name via @d1.name and @d2.name). It’s been a long time since I wrote the code, so I don’t remember how exactly the rules worked, but I remember that I wrote the ppx_deriving rewriter thus:

scan the entire module to which ppx_deriving is being applied, looking for instances of derivers. From all those derivers, get the list of attributes that can be applied to types, and find the attributes that name-clash and name a list of those.
in a second pass, scan thru the module, applying each deriver, and for each, using the information from step #1, you can decide whether (e.g.) @name is an attribute for deriver d1, or you need to see @d1.name.

I fear this isn’t so clear, but my point is, the first pass is a PPX rewriter that accumulates a context, that it passes to the second pass PPX rewriter, that uses it to rewrite code in a manner -driven- by that context.

Topic		Replies	Views
Creating/using extensions that are not possible to impement with Ppxlib Ecosystem	2	552	March 1, 2021
Creating TYPES from PPX Learning	2	1151	February 22, 2019
How to write ppx package that work with multiple OCaml AST version Learning ppx , ppxlib	7	1228	February 21, 2022
Using Ppxlib with Module Signatures Ecosystem ppx , ppx_deriving , ppxlib	13	683	April 4, 2023
A question about meta-quotation (via PPX) Ecosystem ppx	5	770	March 12, 2023

Using a PPX to Modify the AST of Another Module

Related topics