Once you've got return type polymorphism, you really start to miss it in other l...

louthy · on Feb 5, 2018

Not sure if I exactly follow, but this is an implementation of Monoid in C#. The interface can be seen as the type-class definition. The structs are the equivalent of class instances.

If you look at the `static class Monoid` then you can see a general implementation of mconcat which returns an A and works with Empty and Append.

The Program at the end shows it in use with List and String types.

    public interface Monoid<A>
    {
        A Empty();
        A Append(A x, A y);
    }

    public struct MString : Monoid<string>
    {
        public string Append(string x, string y) => x + y;
        public string Empty() => "";
    }

    public struct MList<A> : Monoid<List<A>>
    {
        public List<A> Append(List<A> x, List<A> y) => x.Concat(y).ToList();
        public List<A> Empty() => new List<A>();
    }

    public static class List
    {
        public static S Fold<S, A>(this IEnumerable<A> ma, S state, Func<S, A, S> f)
        {
            foreach(var a in ma)
            {
                state = f(state, a);
            }
            return state;
        }

        public static List<A> New<A>(params A[] items) => new List<A>(items);
    }

    public static class Monoid
    {
        public static A Concat<MA, A>(IEnumerable<A> ma) where MA : struct, Monoid<A> =>
            ma.Fold(default(MA).Empty(), default(MA).Append);

    }

    class Program
    {
        static void Main(string[] args)
        {
            var strs = new[] { "Hello", ",", " ", "World" };
            var lists = new[] { List.New(1, 2, 3), List.New(4, 5, 6) };

            var str = Monoid.Concat<MString, string>(strs);
            var list = Monoid.Concat<MList<int>, List<int>>(lists);
        }
    }

Obviously it's not as elegant as Haskell, but does this not fit your requirement?

marcosdumay · on Feb 5, 2018

Can you see how that small change on the interface makes it awful to write code that is generic on Monoid?

It does not fit the requirements. It makes the kind of code people write in Haskell absolutely not viable.

louthy · on Feb 5, 2018

Why don't you deal with specifics? I am not talking about its attractiveness, I have already made clear that I think Haskell is more elegant. But, why is this "not viable" or "does not fit the requirements"? I have already shown how a totally generic version of mconcat can be implemented - so how is this not writing "code that is generic on Monoid"?

marcosdumay · on Feb 5, 2018

One very specific. I create a logging module on Haskell. There I put some high order functions like this:

    module Log where

        -- | Runs the code inside a catch, logs any exception
        exceptions :: IO a -> IO a
        
        -- | Logs every code execution
        access :: IO a -> IO a

        -- And so on, for several different kinds of logging

Then on the main code I do:

    import qualified Log as Log

    main = do
        -- Lots of stuff
        Log.errors . Log.access $ readSomeData
        Log.errors $ whateverThatCanFail
        Log.access $ shoudlntFailButICareAboutRunning

On C# each use of those logging functions will be more verbose than copying the entire function body in place. And almost as brittle.

louthy · on Feb 6, 2018

Sorry, where did I say C# is less verbose than Haskell? And what has that got to do with implementing polymorphic return values?

moomin · on Feb 5, 2018

You're right, that does work, but you're having to declare every type everywhere. Each time I add another operation to your example, the type declarations get worse and worse. My point is, return type polymorphism and ad-hoc polymorphism make a whole bunch of things ergonomic. The fact that C# can do the same thing in a way that no-one would use is kind of the point.

louthy · on Feb 5, 2018

> declare every type everywhere

What does that mean? The types are declared exactly once.

If you mean C# doesn’t do type inference, well yeah.

But, I still don’t see how this changes my original point, which was merely to demonstrate how to do return type polymorphism in a language other than the sacred Haskell. As the original post gives the implication that it’s not achievable when not working in Haskell.

moomin · on Feb 5, 2018

Ah, I see the confusion. Shouldn’t have used “declare” in that context. My point was that you needed to specify the type every time you used it. So the more operations you add the more your code looks like generic soup.

I think you’re confused as to what return type polymorphism is, though. It’s the ability to have the compiler infer the type of something from its site of use. So your example doesn’t exhibit it because the types need to be specified.

So, the following code works in Haskell

x = [1,2,read “3”]

y = [“1”,”2”, read ”3”]

In the first case read returns an Int, in the second a String. This is a useless toy example but it turns out to be really useful and to make ergonomic a huge number of things that are just painful in the languages we normally use.

Everything’s achievable in every language, but what’s convenient changes massively.

louthy · on Feb 6, 2018

> I think you’re confused as to what return type polymorphism is, though. It’s the ability to have the compiler infer the type of something from its site of use.

I think you may be confused about what return type polymorphism is tbh. Type inference != polymorphism. Polymorphism is a type-system feature. Type inference is a separate process to ascertain concrete types at compile time.

For example, the only type inference that C# really does is `var`:

    var x = foo();

That is no different to me writing:

    int x = foo();

(assuming that foo returns an int that is).

The compiled version of both of those code snippets are the same. The fact that I specified the `int` directly doesn't make those chunks of code any different. Or, make one less valid.

You're right it's absolutely the case that to do this I'd need to specify the types I want to work with, and that's because C# is shit at inferring anything. It doesn't change the fact that the return type is polymorphic for Concat in my example. It's _ad-hoc polymorphic_, and no lack of type-inference will change that.

But this still misses the entire point of what I wrote. Which was to counter the point made in your original post:

> Once you've got return type polymorphism, you really start to miss it in other languages.

Initially I felt I was helping by pointing out that if you're missing it in other (mainstream) languages, then there's absolutely a way of doing it.

> The simplest example possible is "mempty" mempty :: a > gets the "default" value of a. Which makes no sense in a language where you need an instance to have polymorphism.

And secondly I felt it important to use your example of `mempty` to show how to do it.

lmm · on Feb 5, 2018

This uses an unsafe "default(MA)" construct to hack around the type system, right? There's no way to write code like this and not have your code fail with NPEs at runtime except for manually inspecting every "default(...)" call to check that it's called on the right kind of type.

louthy · on Feb 5, 2018

Wrong, MA is constrained to struct; and structs can’t be null

lmm · on Feb 5, 2018

Ok, but someone has to manually check that, since someone could write "default(MA)" with MA not being a struct and this wouldn't be obvious at the call site. And even if we do find a way to automatically enforce that it is a struct, default won't necessarily put it in a valid state, right? (e.g. if the struct contains reference types then we've just moved the problem one step down: the struct can't be null but the things inside the struct can be null).

Edit: Also does this "default" mechanism extend to allowing us to compose typeclass instances out of smaller typeclass instances? E.g. the monad instance for Writer is defined as:

    instance (Monoid w) => Monad (Writer w) where 
        return a             = Writer (a,mempty) 
        (Writer (a,w)) >>= f = let (a',w') = runWriter $ f a in Writer (a',w `mappend` w')

i.e. we can obtain a Monad<Writer<W, ?>> for any W for which we have a Monoid<W>.

louthy · on Feb 5, 2018

Yes, it’s possible for programmers to write bugs.

lmm · on Feb 5, 2018

Well if you don't care about type safety then there's no point caring about any typesystem features, since you can emulate them by replacing all of your types with "any".

louthy · on Feb 5, 2018

Sorry, where on earth did I say I don’t care about type safety? Why do you need to take this point to a total extreme? I simply gave an example of why the comment about mempty was wrong; but now I have to defend C#’s type system?

Clearly C#’s lack of type inference, sanctioned ad-hoc polymorphism (even though it can’t be achieved in the way I have shown), and higher kinds makes it less expressive as a language. I’m not going to argue that point.

But this kind of language holy war is frankly pathetic. Attacking every detail of an implementation (that works) is unnecessary.

Yes, it’s easier to get null reference exceptions in C# compared to Haskell. That is the result of poor decisions made when the language was designed. So, yes, today I will use ad hoc polymorphic techniques and yes I will have to make sure I constrain to structs, that’s life.

lmm · on Feb 5, 2018

> Sorry, where on earth did I say I don’t care about type safety? Why do you need to take this point to a total extreme? I simply gave an example of why the comment about mempty was wrong; but now I have to defend C#’s type system?

If you're going to dismiss safety issues in your approach with "Yes, it’s possible for programmers to write bugs." then there's no point having the conversation, because that's an equally good argument for not having a type system at all.

> But this kind of language holy war is frankly pathetic. Attacking every detail of an implementation (that works) is unnecessary.

It's not a "detail", if you can't do it safely then that undermines the point of doing it at all. If we were willing to be unsafe we could just cast to the desired type.

> Yes, it’s easier to get null reference exceptions in C# compared to Haskell. That is the result of poor decisions made when the language was designed. So, yes, today I will use ad hoc polymorphic techniques and yes I will have to make sure I constrain to structs, that’s life.

I'd sooner pass the module dictionary explicitly, like one does in ML, than adopt a technique that would normalize having "default(...)" in my codebase.

louthy · on Feb 6, 2018

> If you're going to dismiss safety issues in your approach with "Yes, it’s possible for programmers to write bugs." then there's no point having the conversation, because that's an equally good argument for not having a type system at all.

Absolute nonsense. I didn't dismiss safety issues at all. I dismissed your claim that having to specify a `struct` constraint somehow makes the feature unworthy.

C# has null, that's a fact of life, it's not dismissive to realise that a (granted, very annoying) part of the job of writing C# is dealing with null. So, using this doesn't make this technique any less safe than any other way of writing code in C#. So, yes, programmers will occasionally write null-dereference bugs in C# - that's the price we pay for bad language implementation decisions.

Stating "that's an equally good argument for not having a type system at all." is clearly hyperbolic nonsense.

> If we were willing to be unsafe we could just cast to the desired type.

But it isn't unsafe! Not specifying a `struct` constraint is a bug. If you provide the constraint then it's safe. Trying to compare that to a dynamic cast where you have no type-system enforcement to one where you do is just idiotic.

> I'd sooner pass the module dictionary explicitly, like one does in ML, than adopt a technique that would normalize having "default(...)" in my codebase.

At no point was this trying to force you to use this technique. It was a reply to "Once you've got return type polymorphism, you really start to miss it in other languages. The simplest example possible is mempty".

I use this technique very successfully a lot, and the exact mechanism (of using `default`) is in the process of being wrapped up into a new type-classes grammar for C# [1]. So, I guess you'd probably prefer to wait for that...

[1] https://github.com/MattWindsor91/roslyn/blob/master/concepts...

lmm · on Feb 6, 2018

> C# has null, that's a fact of life, it's not dismissive to realise that a (granted, very annoying) part of the job of writing C# is dealing with null. So, using this doesn't make this technique any less safe than any other way of writing code in C#.

If using this technique requires breaking one of the rules that you have to follow to avoid getting nulls in C# then the technique is a safety problem.

> Not specifying a `struct` constraint is a bug. If you provide the constraint then it's safe.

Ok, but how do you enforce that? If you've got a technique that requires manual review and reasoning at a distance to use safely, then again we're no better off than we would be using dynamic casts.

> At no point was this trying to force you to use this technique. It was a reply to "Once you've got return type polymorphism, you really start to miss it in other languages. The simplest example possible is mempty".

If you don't have a typesystem feature in a safe way, you don't have it.

louthy · on Feb 6, 2018

> Ok, but how do you enforce that? If you've got a technique that requires manual review and reasoning at a distance to use safely, then again we're no better off than we would be using dynamic casts.

More hyperbole. Failing to constrain may lead to a null reference exception. Just like passing a reference to any method anywhere in C#. It is no better and no worse than any other C# code. However it does allow for ad-hoc polymorphic return values. Which is the entire point. That is not the same as returning a dynamic value, which is a type that propagates dynamic dispatch wherever it's passed. A failure to capture a null reference bug means on first usage it will blow up - so you fix the code and everything is type safe.

> If you don't have a typesystem feature in a safe way, you don't have it.

The feature is safe. Your argument is the same as saying C# doesn't have classes because a reference can be null, or C# doesn't have fields because a field can be null. All throughout this frankly tedious discussion you have somehow conflated having a bug in an application with having no type system at all. C#'s type system is obviously nowhere near as impressive as Haskell, but C# is actually used in the real world much more, and so if someone wants polymorphic return values then they can. I mean they can anyway through inheritance, never mind the ad-hoc approach I demonstrated - but whatever yeah?

lmm · on Feb 6, 2018

> Failing to constrain may lead to a null reference exception. Just like passing a reference to any method anywhere in C#.

But you can adopt a small set of rules that are locally enforceable (and practical to use in an automatic linter) to prevent this happening (just as Haskell is safe even though unsafePerformIO exists, because you can adopt a small set of locally enforceable rules like "never use unsafePerformIO"). Unfortunately one of those rules has to be to never use default().

> That is not the same as returning a dynamic value, which is a type that propagates dynamic dispatch wherever it's passed. A failure to capture a null reference bug means on first usage it will blow up - so you fix the code and everything is type safe.

Unfortunately default() isn't fail-fast in all cases - when used with e.g. a struct type containing a reference type, it will create the value in an invalid state (containing a null reference) but you won't necessarily notice until you come to use the value, arbitrarily many compilation units away. So it's just as dangerous as a dynamic value.

> All throughout this frankly tedious discussion you have somehow conflated having a bug in an application with having no type system at all.

In almost any language you can have polymorphic return values without complete type safety. The feature that Haskell has here isn't that you can have polymorphic return values - it's that you can have polymorphic return values safely. Showing an unsafe implementation of polymorphic return values in some other language is pointless and irrelevant.

louthy · on Feb 6, 2018

> Unfortunately default() isn't fail-fast in all cases

It's purely a means of dispatch, if someone wants to put member variables in that are never used - good luck to them. For some reason you think that because C# doesn't protect you from being an idiot you can't do return type polymorphism. Well that's completely incorrect and you know it. The reference of default(A) isn't something that's passed around - yes the method you dispatch to has access to `this`, but what's the point of A: declaring a variable in a 'class instance' and B: using it when it's in an invalid state? It's what a moron would do. I don't call `((string)null).ToString()` because it's fucking stupid. But I assume in your world that means C# can't do method dispatch by reference?

Just because somebody can do something stupid doesn't devalue any particular technique that requires you to not do the stupid thing. Otherwise, you may as well delete C# as a language - because it's trivially easy to do stupid things. In fact software engineering wouldn't even have gotten off the ground if that was a pre-requisite.

But clearly people do produce software in it - which proves your arguments wrong.

> Showing an unsafe implementation of polymorphic return values in some other language is pointless and irrelevant.

Show me where it was mentioned in the original comment about safety? Not there is it. You just jumped in with inaccurate claims and went on some tangent about type-system safety, like C# would ever win any type-system safety contests.

Leaving asside the fact that all of your arguments about safety are nonsense for the moment... let's do it another way ...

    public interface Monoid<MA, A> where MA : struct, Monoid<MA, A>
    {
        A Empty();
        A Append(A x, A y);
    }

    public struct MString : Monoid<MString, string>
    {
        public string Append(string x, string y) => x + y;
        public string Empty() => "";
    }

    public struct MList<A> : Monoid<MList<A>, List<A>>
    {
        public List<A> Append(List<A> x, List<A> y) => x.Concat(y).ToList();
        public List<A> Empty() => new List<A>();
    }

    public static class Monoid
    {
        public static A Concat<MA, A>(IEnumerable<A> ma) where MA : struct, Monoid<MA, A> =>
            ma.Fold(default(MA).Empty(), default(MA).Append);
    }

    class Program
    {
        static void Main(string[] args)
        {
            var strs = new[] { "Hello", ",", " ", "World" };
            var lists = new[] { List.New(1, 2, 3), List.New(4, 5, 6) };

            var str = Monoid.Concat<MString, string>(strs);
            var list = Monoid.Concat<MList<int>, List<int>>(lists);
        }
    }

That is now safe in that `Concat` can't be implemented without the `struct` constraint, the code will fail to compile. Also the types that implement `Monoid<MA, A>` must be structs.

I'm out of this discussion now - because if you're still going to claim this is unsafe then you're clearly trolling and I haven't really got the motivation to keep feeding you.

lmm · on Feb 6, 2018

> The reference of default(A) isn't something that's passed around - yes the method you dispatch to has access to `this`, but what's the point of A: declaring a variable in a 'class instance' and B: using it when it's in an invalid state?

It's not something you'd deliberately do, but in any decent-sized codebase, everything the language permits will happen. If it's possible to exclude a given pitfall with a simple, local lint rule then you might be able to avoid it, but manual review of anything that can happen at a distance is doomed to failure.

> I don't call `((string)null).ToString()` because it's fucking stupid. But I assume in your world that means C# can't do method dispatch by reference?

Unless you can use a very simple set of local rules to avoid having that happen, yes. Fortunately, there is such a set of rules you can follow (namely never writing null, never using constructs that return null, and checking the return values of library calls for null immediately) and so null (barely) doesn't destroy the language completely.

> Just because somebody can do something stupid doesn't devalue any particular technique that requires you to not do the stupid thing.

If your technique makes it impossible to use simple rules to avoid doing the stupid thing, then yes, that does devalue the technique. Because at that point having the stupid thing happen in your codebase is just inevitable.

> Otherwise, you may as well delete C# as a language - because it's trivially easy to do stupid things.

I already did, thanks.

> In fact software engineering wouldn't even have gotten off the ground if that was a pre-requisite.

Nonsense. Typed lambda calculi predate mechanical computers and don't allow you to do anything stupid. We could've built software engineering on them.

> But clearly people do produce software in it - which proves your arguments wrong.

People produce software in C#, but it takes more effort and has higher defect rates than doing so in Haskell-like languages.

> Show me where it was mentioned in the original comment about safety? Not there is it.

It's implicit because a) Haskell is a safe-by-default language b) return type polymorphism without safety is completely trivial. In e.g. Python you can just have Concat return "", [], or something else; likewise you can do the same in C# if you're happy to cast. So clearly moomin can't miss just being able to have a function that returns "" or [], because what language could they possibly be working in where that would be impossible or even at all difficult?

> That is now safe in that `Concat` can't be implemented without the `struct` constraint, the code will fail to compile. Also the types that implement `Monoid<MA, A>` must be structs.

But a) I have to allow "default(MA)" expressions in my program, which means I have no way to ban the unsafe use of default() b) nothing stops an implementation of Monoid<MA, A> being a struct that contains a reference, in which case that reference will be null when the struct is initialized with default(). It doesn't solve the problem at all.

moomin · on Feb 5, 2018

That's why be put a struct constraint in. We'll quietly ignore the fact that non-primitive structs are as rare as hens teeth in most C# code-bases...

lmm · on Feb 5, 2018

So you might be able to use some kind of linter to enforce that you only ever call default(MA) where MA: struct, but even then, is it safe to assume that default(x) instantiates x in a valid state for all structs? Wouldn't that then mean that e.g. you couldn't ever use a struct containing a reference type anywhere in your codebase, since if you do then default(x) initialises that struct to contain a null reference, right?

louthy · on Feb 5, 2018

The structs used as ‘class instaces’ aren’t statefull.

The compiler will actually optimise out the ‘default’ also, so it’s as efficient as calling a static method.

Anyway, nobody is arguing that this is some perfect system, merely that return type polymorphism can be achieved _relatively_ painlessly in a language other than Haskell.

_pvxk · on Feb 5, 2018

OTOH, I feel it's taken too far in the regex packages. They feel extremely daunting for someone just getting started with Haskell. E.g. the wiki says that Text.Regex.TDFA is the thing to use. First try:

    λ> "abc" =~ "(a|b).*"

    <interactive>:1558:1-18:
        Non type-variable argument
        in the constraint: RegexContext
                            Text.Regex.TDFA.Regex source1 target
        (Use FlexibleContexts to permit this)
        When checking that ‘it’ has the inferred type
        it :: forall source1 target.
                (Data.String.IsString source1,
                RegexContext Text.Regex.TDFA.Regex source1 target) =>
                target

    <interactive>:1558:7-8:
        Could not deduce (RegexMaker
                            Text.Regex.TDFA.Regex CompOption ExecOption source0)
        arising from a use of ‘=~’
        from the context (Data.String.IsString source1,
                        RegexContext Text.Regex.TDFA.Regex source1 target)
        bound by the inferred type of
                it :: (Data.String.IsString source1,
                        RegexContext Text.Regex.TDFA.Regex source1 target) =>
                        target
        at <interactive>:1558:1-18
        The type variable ‘source0’ is ambiguous
        Note: there are several potential instances:
        instance RegexMaker
                    Text.Regex.TDFA.Regex
                    CompOption
                    ExecOption
                    Data.ByteString.Internal.ByteString
            -- Defined in ‘Text.Regex.TDFA.ByteString’
        instance RegexMaker
                    Text.Regex.TDFA.Regex
                    CompOption
                    ExecOption
                    Data.ByteString.Lazy.Internal.ByteString
            -- Defined in ‘Text.Regex.TDFA.ByteString.Lazy’
        instance RegexMaker
                    Text.Regex.TDFA.Regex
                    CompOption
                    ExecOption
                    (Data.Sequence.Seq Char)
            -- Defined in ‘Text.Regex.TDFA.Sequence’
        ...plus one other
        In the expression: "abc" =~ "(a|b).*"
        In an equation for ‘it’: it = "abc" =~ "(a|b).*"

    <interactive>:1558:10-18:
        Could not deduce (Data.String.IsString source0)
        arising from the literal ‘"(a|b).*"’
        from the context (Data.String.IsString source1,
                        RegexContext Text.Regex.TDFA.Regex source1 target)
        bound by the inferred type of
                it :: (Data.String.IsString source1,
                        RegexContext Text.Regex.TDFA.Regex source1 target) =>
                        target
        at <interactive>:1558:1-18
        The type variable ‘source0’ is ambiguous
        Note: there are several potential instances:
        instance Data.String.IsString
                    aeson-1.2.1.0:Data.Aeson.Types.Internal.Value
            -- Defined in ‘aeson-1.2.1.0:Data.Aeson.Types.Internal’
        instance Data.String.IsString
                    Data.ByteString.Builder.Internal.Builder
            -- Defined in ‘Data.ByteString.Builder’
        instance Data.String.IsString Data.ByteString.Internal.ByteString
            -- Defined in ‘Data.ByteString.Internal’
        ...plus 9 others
        In the second argument of ‘(=~)’, namely ‘"(a|b).*"’
        In the expression: "abc" =~ "(a|b).*"
        In an equation for ‘it’: it = "abc" =~ "(a|b).*"

Scary. Next most promising hit on ddg is bos' tutorial at http://www.serpentine.com/blog/2007/02/27/a-haskell-regular-... which says I should be able to get a list of results by specifying the context [String]:

    λ> "abc" =~ "(a|b).*" :: [String]

    <interactive>:1497:1-5:
        No instance for (Data.String.IsString source10)
        arising from the literal ‘"abc"’
        The type variable ‘source10’ is ambiguous
        Note: there are several potential instances:
        instance Data.String.IsString
                    aeson-1.2.1.0:Data.Aeson.Types.Internal.Value
            -- Defined in ‘aeson-1.2.1.0:Data.Aeson.Types.Internal’
        instance Data.String.IsString
                    Data.ByteString.Builder.Internal.Builder
            -- Defined in ‘Data.ByteString.Builder’
        instance Data.String.IsString Data.ByteString.Internal.ByteString
            -- Defined in ‘Data.ByteString.Internal’
        ...plus 7 others
        In the first argument of ‘(=~)’, namely ‘"abc"’
        In the expression: "abc" =~ "(a|b).*" :: [String]
        In an equation for ‘it’: it = "abc" =~ "(a|b).*" :: [String]

    <interactive>:1497:7-8:
        No instance for (RegexContext
                        Text.Regex.TDFA.Regex source10 [String])
        arising from a use of ‘=~’
        The type variable ‘source10’ is ambiguous
        Note: there is a potential instance available:
        instance RegexLike a b => RegexContext a b [[b]]
            -- Defined in ‘regex-base-0.93.2:Text.Regex.Base.Context’
        In the expression: "abc" =~ "(a|b).*" :: [String]
        In an equation for ‘it’: it = "abc" =~ "(a|b).*" :: [String]

    <interactive>:1497:10-18:
        No instance for (Data.String.IsString source0)
        arising from the literal ‘"(a|b).*"’
        The type variable ‘source0’ is ambiguous
        Note: there are several potential instances:
        instance Data.String.IsString
                    aeson-1.2.1.0:Data.Aeson.Types.Internal.Value
            -- Defined in ‘aeson-1.2.1.0:Data.Aeson.Types.Internal’
        instance Data.String.IsString
                    Data.ByteString.Builder.Internal.Builder
            -- Defined in ‘Data.ByteString.Builder’
        instance Data.String.IsString Data.ByteString.Internal.ByteString
            -- Defined in ‘Data.ByteString.Internal’
        ...plus 7 others
        In the second argument of ‘(=~)’, namely ‘"(a|b).*"’
        In the expression: "abc" =~ "(a|b).*" :: [String]
        In an equation for ‘it’: it = "abc" =~ "(a|b).*" :: [String]

… so something about overloaded strings is confusing the type checker probably. Since I'm not a complete newbie I know I can specify string types explicitly:

    λ> ("abc"::String) =~ ("(a|b).*"::String) :: [String]

    <interactive>:1504:17-18:
        No instance for (RegexContext
                        Text.Regex.TDFA.Regex String [String])
        arising from a use of ‘=~’
        In the expression:
            ("abc" :: String) =~ ("(a|b).*" :: String) :: [String]
        In an equation for ‘it’:
            it = ("abc" :: String) =~ ("(a|b).*" :: String) :: [String]

That's … shorter, but still daunting. If I'm a bit more experienced, I know that I "just" have to do

    λ> :i RegexContext
    class RegexLike regex source =>
        RegexContext regex source target where
    match :: regex -> source -> target
    matchM :: Monad m => regex -> source -> m target
            -- Defined in ‘regex-base-0.93.2:Text.Regex.Base.RegexLike’
    instance RegexContext Text.Regex.TDFA.Regex String String
    -- Defined in ‘Text.Regex.TDFA.String’
    instance RegexContext Text.Regex.Regex String String
    -- Defined in ‘Text.Regex.Posix.String’
    instance RegexLike a b => RegexContext a b [[b]]
    -- Defined in ‘regex-base-0.93.2:Text.Regex.Base.Context’
    instance RegexLike a b => RegexContext a b (MatchResult b)
    -- Defined in ‘regex-base-0.93.2:Text.Regex.Base.Context’
    instance RegexLike a b => RegexContext a b Int
    -- Defined in ‘regex-base-0.93.2:Text.Regex.Base.Context’
    instance RegexLike a b => RegexContext a b Bool
    -- Defined in ‘regex-base-0.93.2:Text.Regex.Base.Context’
    instance RegexLike a b =>
            RegexContext
            a b (AllTextSubmatches [] (b, (MatchOffset, MatchLength)))
    -- Defined in ‘regex-base-0.93.2:Text.Regex.Base.Context’
    instance RegexLike a b => RegexContext a b (AllTextSubmatches [] b)
    -- Defined in ‘regex-base-0.93.2:Text.Regex.Base.Context’
    instance RegexLike a b => RegexContext a b (AllTextMatches [] b)
    -- Defined in ‘regex-base-0.93.2:Text.Regex.Base.Context’
    instance RegexLike a b =>
            RegexContext a b (AllSubmatches [] (MatchOffset, MatchLength))
    -- Defined in ‘regex-base-0.93.2:Text.Regex.Base.Context’
    instance RegexLike a b =>
            RegexContext a b (AllMatches [] (MatchOffset, MatchLength))
    -- Defined in ‘regex-base-0.93.2:Text.Regex.Base.Context’
    instance RegexLike a b => RegexContext a b (b, b, b, [b])
    -- Defined in ‘regex-base-0.93.2:Text.Regex.Base.Context’
    instance RegexLike a b => RegexContext a b (b, b, b)
    -- Defined in ‘regex-base-0.93.2:Text.Regex.Base.Context’
    instance RegexLike a b =>
            RegexContext a b (MatchOffset, MatchLength)
    -- Defined in ‘regex-base-0.93.2:Text.Regex.Base.Context’
    instance RegexLike a b => RegexContext a b ()
    -- Defined in ‘regex-base-0.93.2:Text.Regex.Base.Context’

and somewhere in that long list of instances I see that although [String] is not an instance, [[String]] is, so finally I see I can do

    λ> ("abc"::String) =~ ("(a|b).*"::String) :: [[String]]
    [ [ "abc" , "a" ] ]

Of course, the end result looks nice in the code, but it takes forever to discover the API when you're new to the system (and the docs have next to no examples).

tikhonj · on Feb 5, 2018

Most Haskellers I know think that library is too complicated—it's trying too hard to make Haskell look like Perl. I haven't seen it used in serious code and I never use it myself; chances are, if I want to parse anything even remotely complicated, a library like Parsec is a better bet.

marcosdumay · on Feb 5, 2018

It is daunting and those errors are very hard to follow even after you have some experience with them.

That said, there's a component of "you are using it wrong" on your problem. On the middle of some code, when your data is already in well typed variables, you just write:

    if text =~ "(a|b).*" then trueVal else falseVal

and it just works. It doesn't even matter if text is a String, Text, ByteString, or whatever. Also you just write:

    putStr . concat $ text =~ "(a|b).*

and again, it just works. It does not matter that this is a completely different usage.

tonyg · on Feb 5, 2018

The trick there is to use an explicitly polymorphic instance, which can "decide what it should have been all along" later on. I've written about the idea: https://eighty-twenty.org/2015/01/25/monads-in-dynamically-t.... (OO folklore has long had some form of this idea; in limited forms, you see it in Smalltalk images stretching back decades.)