Does `functor` have the right semantics for Flux? #49

CarloLucibello · 2021-10-11T20:21:17Z

Due to the default fallback

functor(T, x) = (), _ -> x

in Functors.jl, every custom type is considered a leaf (i.e. it has no children) and we have to sprinkle @functor MyType everywhere in Flux and in user code.
We could remove all this boilerplate by having by default what @functor MyType currently does. Then 99% of people could live their life completely unaware of @functor/functor (historically poorly documented and poorly understood) and only use the much clearer trainable(x::MyType) in case they need to customize the parameter collection.

Besides the transition, which I think could be made rather smooth, does anyone see any counterindication in changing the default?

The text was updated successfully, but these errors were encountered:

DhairyaLGandhi · 2021-10-11T20:40:34Z

The approach in Functors was chosen to make conservative estimates about the expected behavior of Julia code. In many cases, custom types use multiple dispatch and overload definitions exported by base or other packages to suit their needs. It includes overloading functions that may be used for different reasons but show up in ML frequently - like getindex, LinAlg operations and the like which requires us to use those definitions.

If we remove the consideration of when to stop recursing, we would not be dispatching to the right methods by default. This can produce very hard to debug cases.

Also, @functor already supports giving a tuple of fields to collect parameters from and it would leave everything else untouched. Something like @functor MyType (a,b,...)

To be flexible, we certainly need to be able to distinguish leaves from non leaves, and replacing that need with a different API would need something close to the current definition. It is possible to do with assuming that every object can be recursed into - it requires us to mark things differently, see https://blog.ploeh.dk/2018/08/06/a-tree-functor/ for examples of how reading collections and enumerations and so on require different implementations.

ToucheSir · 2021-10-12T00:46:30Z

I hope you're ready, Carlo, because this is (both technically and philosophically) one heck of a rabbit hole ;)

To start, let me say that many of the questions around the design of Functors are reminiscient of those encountered during the design of ChainRules. For example, how to represent the functored form of a value is almost the same question as how to represent the tangent. You may be able to get more out of the wonderful documentation there than my ramblings below.

In short, "functors" in Functors.jl really ought to be base functors of the values they represent. Why do we need base functors? Because not all custom types in Julia are fully generically parameterized and many algorithms for working with functor tree (or DAG, in our case) traversal require more type fluidity. You can see that similar libraries like Flatten.jl require fully generic types for this reason.

However, making a proxy type for everything is both inefficient and unnecessary. Just like many primitive and array types are perfect natural tangent representations of themselves, so too are many of the same types already valid functors. Hence we can make the distinction between structural and natural functors, i.e. some hypothetical Functor{T} and just T. I'll note with some smugness here that Haskell, by virtue of its type system/stdlib, can't do this as well! Just like a Tangent{T}, a Functor is mostly a smarter (Named)Tuple that can be used as a flexible proxy/internal representation for dispatch and other functions such as traversal.

All that said, how do we create Functor{T}s from Ts? Here we have two options:

Opt-in code generation via @functor like we have now
(Opt-out) automatic synthesis a la ChainRulesCore

Which brings us back to the topic of this issue. Originally, I too thought that doing anything but option 1 would be too risky. However, the success and relative lack of fires popping up with ecosystem-wide adoption of ChainRules seems to counter that idea. Here I would highly recommend skimming the epic, multi-issue discussion around natural vs structural tangents in ChainRulesCore, culminating in @willtebbutt's proposal in JuliaDiff/ChainRulesCore.jl#449.

CarloLucibello · 2021-10-12T02:13:27Z

Thanks Brian, that was highly informative

CarloLucibello · 2022-11-15T08:32:43Z

@mcabbott @darsnack and I are generally in favor of this, although it should be carefully tested before release. @ToucheSir?

ToucheSir · 2022-11-15T23:28:14Z

I'd be in favour of automatic synthesis as well. One worry I had at the time was structured arrays, but with #33 we appear to have a plan for those now.

mcabbott · 2022-11-16T00:21:59Z

Structured arrays do seem like a concern, as #33 required a hand-written inverse, impossible for some types. We could easily exclude them from a traverse-anything scheme.

CarloLucibello mentioned this issue Oct 16, 2021

functor RefValue #26

Merged

CarloLucibello mentioned this issue Apr 24, 2022

@functor is a bad name and nobody knows to use it FluxML/Flux.jl#1946

Open

CarloLucibello transferred this issue from FluxML/Flux.jl Nov 15, 2022

CarloLucibello added this to the v0.5 milestone Nov 15, 2022

CarloLucibello added breaking and removed breaking labels Nov 15, 2022

CarloLucibello mentioned this issue Nov 25, 2022

functor by default #51

Merged

3 tasks

CarloLucibello mentioned this issue Jun 16, 2023

Should everything be a functor by default? FluxML/Flux.jl#2269

Closed

CarloLucibello mentioned this issue Mar 11, 2024

add getkeypath and haskeypath #76

Merged

2 tasks

CarloLucibello closed this as completed in #51 Nov 1, 2024

github-project-automation bot moved this to Done in Explicit Parameter Transition Nov 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does `functor` have the right semantics for Flux? #49

Does `functor` have the right semantics for Flux? #49

CarloLucibello commented Oct 11, 2021 •

edited

Loading

DhairyaLGandhi commented Oct 11, 2021 •

edited

Loading

ToucheSir commented Oct 12, 2021 •

edited

Loading

CarloLucibello commented Oct 12, 2021

CarloLucibello commented Nov 15, 2022

ToucheSir commented Nov 15, 2022

mcabbott commented Nov 16, 2022

Does functor have the right semantics for Flux? #49

Does functor have the right semantics for Flux? #49

Comments

CarloLucibello commented Oct 11, 2021 • edited Loading

DhairyaLGandhi commented Oct 11, 2021 • edited Loading

ToucheSir commented Oct 12, 2021 • edited Loading

CarloLucibello commented Oct 12, 2021

CarloLucibello commented Nov 15, 2022

ToucheSir commented Nov 15, 2022

mcabbott commented Nov 16, 2022

Does `functor` have the right semantics for Flux? #49

Does `functor` have the right semantics for Flux? #49

CarloLucibello commented Oct 11, 2021 •

edited

Loading

DhairyaLGandhi commented Oct 11, 2021 •

edited

Loading

ToucheSir commented Oct 12, 2021 •

edited

Loading