Skip to content

Latest commit

 

History

History
236 lines (143 loc) · 14.4 KB

FST-1027-fsi-references.md

File metadata and controls

236 lines (143 loc) · 14.4 KB

F# RFC FST-1027 - FSI Reference Model and extending #r

Summary

The following extensions to #r are proposed:

  • #r "[dependency manager]: <dependency manager command>" like
    • #r "paket: <paket command>"
    • #r "nuget: <package-name>, <package-version>"

This extends the "language" of #r to support NuGet packages, Paket dependencies, and potentially more.

Motivation

There is a strong desire to reference packages via #r instead of assemblies. There has also been experimental work to support Paket in this way.

Detailed design

To begin, existing behavior with #r will remain unchanged for FSI. That is, if you reference an assembly today via #r, it will always continue to work in the same way that it always has.

The introduction of #r "nuget: name, version" and of the extension mechanism underlying it will simplify referencing dependencies in F# script files in many use cases.

Design-Time

A few things should happen at design time when using F# Scripting in an editor (VS, VS for Mac, Ionide) which supports IntelliSense.

In the case of referencing a package via a specific tool (e.g., #r "nuget:..." or #r "paket:..."), F# interpreter will follow such process:

  1. Look for the extension assemblies in a blessed location which is not specified, and is tooling dependent, but can also be passed to the --compilertool: flag to fsi or FCS enabled tooling. If it can't find it, error message point at the locations scanned and the currently loaded extensions and abort, otherwise proceed.
  2. Call out to the tool to fetch packages and resolve dependencies.
  3. Reference the resolved .dlls via #r in an ephemeral script.
  4. Implicitly #load that ephemeral script to the current script.

Note that in the following examples, the implementation details are only indicative and may not match actual implementation of such extensions.

Example - Paket:

#r "paket: nuget Newtonsoft.Json ~> 9.0.1" // Example. All paket features are allowed

After the above is submitted as an evaluation, the editor will:

  1. Check if it can find a registered dependency manager which reacts to the prefix "paket:"
  2. If the paket FSI extension is found it will call a ResolveDependencies method on it a. Paket will internally check if it needs to restore the dependency or if everything is already in place. Any error from Paket will be shown. b. Paket will create an ephemeral script that #r all direct and indirect libraries for the specified dependency.
  3. Implicitly #load the generated script in the current script.

After everything is fetched, resolved, and loaded, types from Newtonsoft.Json will be available after opening the namespace.

Example - NuGet:

#r "nuget: Newtonsoft.Json, 9.0.1" // Example version, could be anything

After the above is typed, the editor will:

  1. Check if it can find a registered dependency manager which reacts to the prefix "nuget:"
  2. If the nuget FSI extension is found, check to see if the specified dependency is already in the nuget cache folder on the given machine. a. If the exist, create an ephemeral script which loads the .dll with #r, and then implicitly #load that script into the current one.
  3. If there is no such folder or dependency, will look for NuGet on the PATH (or some other blessed location).
  4. Call out to NuGet to fetch and resolve the dependency.
  5. Create an ephemeral script and #r the assemblies that NuGet just resolved. Any error from NuGet will be shown.
  6. Implicitly #load the generated script into the current script.

Example - Projects:

#r "project: MyProject"

After the above is typed, the editor will:

  1. Check to see if the containing project already has a reference to MyProject.dll.

    a. If it does not find it, it will call out MSBuild (from the PATH or other blessed location - if it can't find it, error) for MSBuild to build that project. b. Once it has MyProject.dll, it will reference it via #r in an ephemeral script.

  2. Implicitly #load the ephemeral script into the current script in the editor.

Runtime

Behavior should not be so different at runtime. When executing code interactive from a design-time scripting session, things are already resolved, so there is no extra consideration here - FSI has all the information it needs to execute code.

Starting FSI with the script

When launching FSI with a script using one of the new #r references, FSI will do the following:

  1. Check which tools is registered for a given #r prefix. Error out if no tool can be found.
  2. Let the tool check if each dependency exists in the known place for the specified tool or otherwise the tool resolve all dependencies.
  3. #r the resolved dependencies in an ephemeral script (or the script generated by Paket if using Paket).
  4. #load that script into the FSI session, thus giving it the references it needs to execute.

Running a script in an active FSI session

This scenario is as follows: I have launched FSI at some point, and now I wish to execute a script which may or may not have references that I already have loaded into FSI. The behavior should actually be the same as if FSI were launched with the script:

  1. Check which tools is registered for a given #r prefix. Error out if no tool can be found.
  2. Call out to the tool to resolve dependencies.
  3. #r the resolved dependencies in an ephemeral script (or the script generated by Paket if using Paket). Check version numbers against what we already have loaded in the FSI session. If any .dll names and versions match, don't #r them into the script.
  4. #load that script into the FSI session, thus giving it the references it needs to execute.

This is because:

  1. A reference may already be loaded for the current FSI session, but a different version was specified in that script. We need to allow that to error out, because the script code may depend on a higher or lower version than what is already loaded. That could cause a runtime error if we just allowed it to run on whatever we had already loaded.
  2. It's too complicated to attempt to only partially load stuff if something is already referenced.

Referencing a new dependency in an active FSI session

This is the interactive scenario. Interactively in an active FSI session, someone types, for example, #r "paket: nuget Newtonsoft.Json". Behavior should be identical to when running a script in an active FSI session:

  1. Check if each dependency to be loaded exists in the known place for the specified tool. If it does, #r that dependency when executing FSI.
  2. If a dependency does not exist, search for the tool in a known/blessed area. Error out if it can't be found.
  3. Call out to the tool to resolve dependencies.
  4. #r the resolved dependencies in an ephemeral script (or the script generated by Paket if using Paket). Check version numbers against what we already have loaded in the FSI session. If any .dll names and versions match, don't #r them into the script.
  5. #load that script into the FSI session, thus giving it the references it needs to execute.

As above, if they somehow specify something which is already loaded, we should just let FSI attempt to load it and generate the error.

Referencing packages containing native DLLs

Referencing packages containing native DLLs is supported with some limitations.

The support is achieved by adding an event handler to the .NET Core event AssemblyLoadContext.Default.ResolvingUnmanagedDll, which is triggered when resolving an unmanaged assembly in the context of a .NET assembly (e.g. a DllImport).

This handler consults current architecture and platform settings plus resolved package metadata and files across all dynamically referenced packages to look for a matching native DLL and then dynamically loads that DLL using an internal NativeAssemblyLoadContext that implements LoadNativeLibrary via LoadUnmanagedDllFromPath.

This process is not triggered for transitive native-to-native references, which are resolved with respect to the native DLL using standard rules of the operating system. Normally this means any transitive native dependencies must sit next to the native DLL at time of load.

Errors

FSI will have to display errors from the underlying assembly resolver. For example, if someone specifies #r "paket: <paket-command>" and Paket generates an error, FSI must display that.

There is also another scenario to consider, which we will explicitly not handle. For example, given the following:

#r "nuget: Foo.Bar" // Depends on System.Whatever version 4.1.0
#r "nuget: Baz.Qux" // Depends on System.Whatever version 4.2.0

// some scripting code

This will generate an error because System.Whatever version 4.1.0 will already be loaded. It's not possible to load two versions of the same assembly in the same session without doing so within an AssemblyLoadContext, which is not something we will do at this time. Thus, a meaningful error must be generated stating that the nuget: Baz.Qux requires System.Whatever version 4.2.0, but version 4.1.0 was already loaded.

We may consider using AssemblyLoadContext in some clever way in the future, but it's very difficult to get right and well outside the scope of what we need to support for a .NET Core 2.0 timeframe.

Handler resolution

It works in all FCS driven tool (or even fsc itself) through :

  • having the extension assembly next to the process one
  • the --compilertool: flag, which can point to a directory containing the extensions

By convention, tools that load extensions in other locations, should also load extensions that are sitting in same folder, unless there are good reasons not doing so.

By convention, tools that load extensions, should never fail to do so for binding redirect matters related to FSharp.Core version the extensions must have been compiled against.

Note: additional stable locations were considered but not implemented in earlier drafts of this RFC.

proposed design in initial draft (this is NOT implemented)

FSI/Design time support will look at the following places in order:

  • if the script is a physical file, check current folder and browse all parent folders looking for .fsharp/fsx-extensions folder in each one
  • look into ~/.fsharp/fsx-extensions
  • look into .fsharp/fsx-extensions folder next to fsi.exe

gather all the distinct dll names, order of precedence favoring those in the same order shown above, and load them in the process if their assembly contains an arbitrary attribute (resolved by name rather than dependency on external library) and types marked with same attributes.

On .NET Framework, the dll are loaded through Assembly.LoadFrom.

On .NET Core the loading mechanism is yet to be determined (TBD), although the extensions will need to be targeting .NET Standard 1.6 or higher.

The fact that those dll will be loadable for both .NET Core and .NET Framework compilers is yet to be determined (TBD).

In context of tooling, the location are scanned initially once per interactive session, but as evaluation of additional script occurs, it might be necessary to scan additional locations (same rules apply, but it will only scan the additional places from first point in the list above, and won't scan again those folders already scanned).

If a handler key (such as nuget or paket) is found several times, report a warning showing location of assemblies and showing which one was picked (we apply same order of precedence as for finding the assemblies).

Tooling and FSharp.Compiler.Service

Language service and FCS tooling must be efficient given incremental editing of files.

Design-time resolution of scripts is orthogonal to runtime and carries some concerns:

  • Editor performance

  • A delay in various tooling due to nuget resolution happening in the background (and the user not knowing why this is happening)

  • Ensuring that we don't re-resolve things unnecessarily

  • Future extensions to indicate background work is happening for script editing, like how project integration tooling does today

Drawbacks

A potential for increased complexity in Scripting. This also has downstream effects in documentation and the general mindshare on how to reference assemblies for F# scripting.

Alternatives

Another alternative is to add more directives, such as the following:

  • #r-impl "<path-to-impl-assembly>
  • #r-nuget "<package-name>, <package-version>
  • #r-paket "<paket-command>: <package-or-dependency>
  • #r-project "<projectname>"

A drawback here is that this is not something that C# would likely do in C# scripting. Although the goal isn't to align with whatever C# is doing, differentiating on directives doesn't seem worth it unless a new directive is added for each type of operation we wish to perform. Having a new directive for one type of operation (e.g., #r-impl), but no new directive for the rest would be strange and inconsistent.

Unresolved questions

  • One technical concern about this mechanism is that the F# scripting model implementation currently has a set of default references to a bunch of DLLs. For facade assemblies these can resolve to, say, System.IO 4.1.1.0. But later packages may need later versions of facade DLLs such as System.IO. It seems that the package manage should be given the opportunity to decide what to do with these default references so that a more up-to-date set of references can be determined at scripting engine startup

  • The discussion here is relevant: dotnet/fsharp#3307 (comment)

  • Should we be adding a #r-typeprovider facility for package managers to decide to reference type providers independent of any specific runtime DLL

  • Comment here:

At the high level I'm just wondering if we can use .targets/.props in the nuget package to compute the relevant type provider references, and incorporate this into the #r "nuget: Foo" and/or #r "paket: Foo" mechanism for incrementally added references during scripting.