[ocaml-platform] on the need and design of OCaml namespaces

Thu Feb 28 12:44:59 GMT 2013

I think the whole discussion of "compilation argument vs. information
in source header" is misguided. The two places are essentially
isomorphic, and a robust system could make them equi-expressive. In
particular, the idea that what putting namespace information outside
the source code is a regression is incorrect, as the semantics of an
OCaml program *already* depends on the state of the filesystem on
which it is compiled and the search path set at compilation time.
Namespaces by themselves introduce no regression or improvement on
this aspect.

There are practical reasons why one would favor one or the approach in
a particular case, but it is not possible to satisfy all criterions
simultaneously.

1. People are very sensitive about *source language* changes, not so
much about changes in compilation interface (perceived as anecdotical
tooling); this makes it easier to suggest new flags than new language
features. Arguably both should treated with the same care, and in
particular design decisions regarding tooling should be inspected with
scientific rigor. This doesn't happen in practice (which of the
proposed tool changes during this discussion was accompanied with a
rigorous specification of the semantics?), so tooling changes are
easier and faster to get in a language release, and easier to get
half-specified and half-baked.

2. Such header information may need to be shared by several files of a
project and may become a source of painful redundancy. Compilation
flags are an easy way to factorize these redundancies away. In
principle this could also be done in the source code (by having
separate files with this source information, and a semantic
#include-like language construct), but this means yet another language
change that has to be carefully designed.

3. People naturally share source snippets without thinking of sharing
the surrounding contextual information (the state of the compilation
environment), so in practice having information in the source helps to
be more robust wrt. environmental change. In mails, Q&A sites, etc.,
sharing source text is better than a tarball with a _tags file or
whatnot.

It's well and good to hope that any semantic features associated to
namespaces will eventually end up in the source language as well. But
that may be significantly harder than agreeing on those features in
the first place. Because picking a good syntax is really hard, and
because this may require more change to other tools that are not part
of the OCaml compiler distribution, and is therefore more costly for
everyone involved (this is Alain's understandable reason for shunning
any such change). So I would like to encourage people to discuss which
semantics they need and want, what a good design would be, rather than
whether to implement this inside or outside the source language.

On Thu, Feb 28, 2013 at 10:01 AM, Erkki Seppala
<flux-ocaml-platform at inside.org> wrote:
> TLDR; I agree with the parts of proposal suggesting statement "namespace
> X" to be introduced to .ml and .mli files and then statements "open
> namespace X", "let namespace Y = X" and their expression forms. And that
> this all should be controlled from the source code, not the build
> system.
>
> Yaron Minsky <yminsky at janestreet.com> writes:
>
>>   My biggest objection to having opens be at the build system level is
>>   that it makes your code more ambiguous.  When you do namespace
>>   manipulations, you very much want to see what's happening by
>>   inspecting the source.  We have a vigorous code review system here,
>>   and I don't want to start adding code review of the build rules to
>>   it, and this change would require that.
>
> I agree that namespaces should be defined and accessed through the
> source code. Not the least because it depends on the project's chosen
> build system on how I should go on changing those definitions. While
> currently the situation is the same regarding findlib package
> management, I think it is worth considering that perhaps this kind of
> namespace system could automatically lead into the build system
> (ocamldep?)  inferring which packages you need to use and eliminate that
> piece of configuration from the build system. Only special cases would
> need more configuration.
>
> While C++ might not be the golden standard to look upon on language
> development, perhaps it can serve as an inspiration. Its namespace
> system seems to work pretty nicely without complaints from the
> developers. It has the following constructs:
>
> namespace X {
>   /* introduce values for the namespace X */
>   int a = 42;
>   namespace Z {
>     /* introduce namespace X::Z */
>     int b = a;
>   }
> }
>
> namespace X {
>   /* introduce more values for the namespace X */
>   int z = 5;
> }
>
> namespace Y = X; /* alias a namespace */
>
> int b = X::a; /* access a value in a namespace */
>
> using namespace X; /* bring X to current scope (works in function scope
>                       as well */
>
> using X::a; /* bring one value from a namespace to current scope */
>
> In implementation side it works by simply relabeling symbols in the
> generated object files which makes the namespaces open.
>
> C++'s compilation system with include files is of course very different
> from OCaml, but a similar solution to OCaml would mean having a hidden
> internal name (such as namespace_modulename or possibly even something
> that like namespace#modulename) for the module created from that
> compilation unit. It would need to be limited to a single top-level
> namespace specification per compilation unit, so it would work sort of
> in reverse compared to C++. For example if x.ml contained statement
> "namespace Foo", a module Foo#X would be generated instead of plain X.
>
> This solution would not allow the same module names from different
> namespaces to coexist in a single directory, but I don't see this as an
> essential feature. The source file names already would be the same, and
> OCaml compiler produces object files to the same directory as the
> source. If you have a build system copying files around, it could (and
> perhaps should!) preserve the directory hierarchy.
>
>> - NAMESPACES WITH VALUES.  I have argued for allowing the opening of a
>>   namespace to also implicitly open some modules, this essentially
>
> The previosuly described system would also eliminate the C++-like
> ability of introducing values to a namespace, unless special module
> names or other changes, such as an automatic opening of some modules,
> are implemented. It would be nice to be able to define such top-level
> values from multiple compilation units, but I'm uncertain how that would
> be implemented.
>
> This approach would need the compiler to find many similarly named .cmi
> files and decide the proper one to use based on their contents, as
> mentioned in a previous proposal.
>
> Everyone seems to agree that # is a good operator for accessing modules
> (or values) from namespaces. Or maybe we could go with \ ;-).
>
>>   But what Alain is proposing is to make opening a namespace silent at
>>   the source level.  This strikes me as a grave error.
>
> I agree. At worst this could mean that in large projects developers let
> the build system recompile their files so they can see what namespaces
> they are using. It seems much better to me to just read it from the
> source code, which is the same regardless of the project.
>
> --
>   _____________________________________________________________________
>      / __// /__ ____  __               http://www.modeemi.fi/~flux/\   \
>     / /_ / // // /\ \/ /                                            \  /
>    /_/  /_/ \___/ /_/\_\@modeemi.fi                                  \/
> _______________________________________________
> Platform mailing list
> Platform at lists.ocaml.org
> http://lists.ocaml.org/listinfo/platform