In case you've wondered what a monoid is, that's a monoid. Something with an ass...

dllthomas · on Aug 12, 2020

Every monoid is a semigroup, but it's only a monoid if there is also a value that serves as an identity.

lliamander · on Aug 12, 2020

Yep. And if what you have is an Abelian Group, then you also get distributed computation as well (thanks to commutativity).

dllthomas · on Aug 12, 2020

While true, that's too strict. An Abelian group (like any group) needs inverses. You get distributed computation if you've got an Abelian semigroup.

lliamander · on Aug 12, 2020

Thanks for the correction. I think that in Avi Bryant's talk (that I linked to above) Stripe ended up using Abelian groups for some reason, rather than Abelian semigroups, though if so I forget the reason why.

dllthomas · on Aug 12, 2020

Inverses don't show up as much as I'd (aesthetically?) like in computing. There was an interesting application here: https://www.reddit.com/r/haskell/comments/9x684a/edward_kmet...

Koshkin · on Aug 12, 2020

To be fair, Abel did not know (or care) about semigroups.

dllthomas · on Aug 18, 2020

That matches my understanding, but the terminology is still (IME) common.

gnulinux · on Aug 12, 2020

You can distribute the computation on just a monoid as well but it needs more bookkeeping. In particular, your reduce function should know

* lhs is before rhs

* There is no data between lhs and rhs

dllthomas · on Aug 12, 2020

One way of looking at it is that equipping our data with that bookkeeping gives us something that commutes.

gnulinux · on Aug 12, 2020

Hmm sure, but it is not a requirement that your underlying algebraic structure should commute, so I think original phrasing was misleading. The bookkeeping allows you to commute a specific list of objects, even though the underlying operation is anti-commutative (i.e. exists a,b a.b != b.a).

At the moment of computation, you can build a new structure that commutes by enumerating the data. I guess it's true that you need a commuting intermediate data structure to be able to distribute.

dllthomas · on Aug 13, 2020

Yeah, I think it's informative that you "need commutativity" but important that you can build it yourself. It's nice (mostly from an efficiency standpoint, sometimes from a complexity stnadpoint) when you can get it "for free" because the underlying type is commutative, and the fact that you're shooting for commutativity can inform how you build and test the bookkeeping.

As an interesting nit, "anticommutative" specifically means that a.b = -(b.a), which is different than simply not being commutative.

A group might be commutative, anticommutative, neither, or even both (trivially true of the empty group and the group with one element, but I think it can be true of larger groups).

gnulinux · on Aug 13, 2020

Ah, my terminology is rusty as it's been years since my math classes. I guess I need to review my algebra books again. I do have some personal code that called "exists a,b a.b != b.a" anticommutative, didn't even realize that's wrong terminology!

lliamander · on Aug 12, 2020

One of the ideas in the talk I link is how you can represent typically non-commutative data (like averages) in a data structure that does support commutative operations (numerator/denominator pair) and to take advantage of generic analytics infrastructure.