Haskell (programming language)
Haskell (programming language)
Paradigm | Functional |
---|---|
Designed by | Lennart Augustsson, Dave Barton, Brian Boutel, Warren Burton, Joseph Fasel, Kevin Hammond, Ralf Hinze,Paul Hudak,John Hughes, Thomas Johnsson, Mark Jones,Simon Peyton Jones,John Launchbury,Erik Meijer, John Peterson, Alastair Reid, Colin Runciman,Philip Wadler |
First appeared | 1990[1] |
Stable release | |
Preview release | |
Typing discipline | Inferred,static,strong |
OS | Cross-platform |
Filename extensions | .hs,.lhs |
Website | |
Majorimplementations | |
GHC,Hugs, NHC, JHC,Yhc, UHC | |
Dialects | |
Helium,Gofer | |
Influenced by | |
Clean,[4]FP,[4]Gofer,[4]Hopeand Hope,[4]]Id,[4]ISWIM,[4]KRC,[4]Lisp,[4]Miranda,[4]MLandStandard ML,[4]Orwell,SASL,[4]Scheme,[4]SISAL[4] | |
Influenced | |
Agda,[5]Bluespec,[6]C++11/Concepts,[7]C#/LINQ,[8][9][10][11]CAL,Cayenne,[8]Clean,[8]Clojure,[12]CoffeeScript,[13]Curry,[8]Elm,Epigram,Escher,[14]F#,[15]Frege,[16]Hack,[17]Idris,[18]Isabelle,[8]Java/Generics,[8]LiveScript,[19]Mercury,[8]Ωmega,Perl 6,[20]PureScript,[21]Python,[8][22]Rust,[23]Scala,[8][24]Swift,[25]Timber,[26]Visual Basic 9.0[8][9] |
Haskell /ˈhæskəl/[27] is a statically typed, purely functional programming language with type inference and lazy evaluation.[28][29] Type classes, which enable type-safe operator overloading, were first proposed by Philip Wadler and Stephen Blott for Standard ML and implemented later in Haskell.[30][31] Its main implementation is the Glasgow Haskell Compiler. It is named after logician Haskell Curry.[1]
Paradigm | Functional |
---|---|
Designed by | Lennart Augustsson, Dave Barton, Brian Boutel, Warren Burton, Joseph Fasel, Kevin Hammond, Ralf Hinze,Paul Hudak,John Hughes, Thomas Johnsson, Mark Jones,Simon Peyton Jones,John Launchbury,Erik Meijer, John Peterson, Alastair Reid, Colin Runciman,Philip Wadler |
First appeared | 1990[1] |
Stable release | |
Preview release | |
Typing discipline | Inferred,static,strong |
OS | Cross-platform |
Filename extensions | .hs,.lhs |
Website | |
Majorimplementations | |
GHC,Hugs, NHC, JHC,Yhc, UHC | |
Dialects | |
Helium,Gofer | |
Influenced by | |
Clean,[4]FP,[4]Gofer,[4]Hopeand Hope,[4]]Id,[4]ISWIM,[4]KRC,[4]Lisp,[4]Miranda,[4]MLandStandard ML,[4]Orwell,SASL,[4]Scheme,[4]SISAL[4] | |
Influenced | |
Agda,[5]Bluespec,[6]C++11/Concepts,[7]C#/LINQ,[8][9][10][11]CAL,Cayenne,[8]Clean,[8]Clojure,[12]CoffeeScript,[13]Curry,[8]Elm,Epigram,Escher,[14]F#,[15]Frege,[16]Hack,[17]Idris,[18]Isabelle,[8]Java/Generics,[8]LiveScript,[19]Mercury,[8]Ωmega,Perl 6,[20]PureScript,[21]Python,[8][22]Rust,[23]Scala,[8][24]Swift,[25]Timber,[26]Visual Basic 9.0[8][9] |
History
Following the release of Miranda by Research Software Ltd. in 1985, interest in lazy functional languages grew. By 1987, more than a dozen non-strict, purely functional programming languages existed. Miranda was the most widely used, but it was proprietary software. At the conference on Functional Programming Languages and Computer Architecture (FPCA '87) in Portland, Oregon, there was a strong consensus that a committee be formed to define an open standard for such languages. The committee's purpose was to consolidate existing functional languages into a common one to serve as a basis for future research in functional-language design.[38]
Haskell 1.0 to 1.4
Type classes originated in Haskell
The first version of Haskell ("Haskell 1.0") was defined in 1990.[1] The committee's efforts resulted in a series of language definitions (1.0, 1.1, 1.2, 1.3, 1.4).
Haskell 98
In late 1997, the series culminated in Haskell 98, intended to specify a stable, minimal, portable version of the language and an accompanying standard library for teaching, and as a base for future extensions. The committee expressly welcomed creating extensions and variants of Haskell 98 via adding and incorporating experimental features.[38]
In February 1999, the Haskell 98 language standard was originally published as The Haskell 98 Report.[38] In January 2003, a revised version was published as Haskell 98 Language and Libraries: The Revised Report.[29] The language continues to evolve rapidly, with the Glasgow Haskell Compiler (GHC) implementation representing the current de facto standard.[39]
Haskell 2010
Haskell supports both sum types and product types
In early 2006, the process of defining a successor to the Haskell 98 standard, informally named Haskell Prime, began.[40] This was intended to be an ongoing incremental process to revise the language definition, producing a new revision up to once per year. The first revision, named Haskell 2010, was announced in November 2009[2] and published in July 2010.
Haskell 2010 is an incremental update to the language, mostly incorporating several well-used and uncontroversial features previously enabled via compiler-specific flags.
Hierarchical module names.
Module names are allowed to consist of dot-separated sequences of capitalised identifiers, rather than only one such identifier.
This lets modules be named in a hierarchical manner (e.g., Data.List
instead of List), although technically modules are still in a single monolithic namespace. This
extension was specified in an addendum to Haskell 98 and was in practice universally used.
The foreign function interface (FFI) allows bindings to other programming languages. Only bindings to C are specified in the Report, but the design allows for other language bindings. To support this, data type declarations were permitted to contain no constructors, enabling robust nonce types for foreign data that could not be constructed in Haskell. This extension was also previously specified in an Addendum to the Haskell 98 Report and widely used.
So-called n+kpatterns (definitions of the form fact (n+1) = (n+1)* fact n) were no longer allowed. This syntactic sugar had misleading semantics, in which the code looked like it used the (+) operator, but in fact desugared to code using (-) and (>=).
The rules of type inference were relaxed to allow more programs to type check.
Some syntax issues (changes in the formal grammar) were fixed: pattern guards were added, allowing pattern matching within guards; resolution of operator fixity was specified in a simpler way that reflected actual practice; an edge case in the interaction of the language's lexical syntax of operators and comments was addressed; and the interaction of do-notation and if-then-else was tweaked to eliminate unexpected syntax errors.
The LANGUAGE pragma was specified. By 2010 dozens of extensions to the language were in wide use, and GHC (among other compilers) provided the LANGUAGE pragma to specify individual extensions with a list of identifiers. Haskell 2010 compilers are required to support the Haskell2010 extension, and are encouraged to support several others which correspond to extensions added in Haskell 2010.
Features
Haskell features lazy evaluation of expressions, that is, 'call by need'
Haskell features lazy evaluation, lambda expressions, pattern matching, list comprehension, type classes and type polymorphism. It is a purely functional language, which means that functions generally have no side effects. A distinct construct exists to represent side effects, orthogonal to the type of functions. A pure function can return a side effect that is subsequently executed, modeling the impure functions of other languages.
The construct that represents side effects is an example of a monad. Monads are a general framework that can model different kinds of computation, including error handling, nondeterminism, parsing and software transactional memory. Monads are defined as ordinary datatypes, but Haskell provides some syntactic sugar for their use.
Haskell has an open, published specification,[29] and multiple implementations exist. Its main implementation, the Glasgow Haskell Compiler (GHC), is both an interpreter and native-code compiler that runs on most platforms. GHC is noted for its rich type system incorporating recent innovations such as generalized algebraic data types and type families. The Computer Language Benchmarks Game also highlights its high-performance implementation of concurrency and parallelism.[43]
An active, growing community exists around the language, and more than 5,400 third-party open-source libraries and tools are available in the online package repository Hackage.[44]
Code examples
A "Hello, World!" program in Haskell (only the last line is strictly necessary):
The factorial function in Haskell, defined in a few different ways:
As the Integer type has arbitrary-precision, this code will compute values such as factorial 100000 (a 456,574-digit number), with no loss of precision.
An implementation of an algorithm similar to quick sort over lists, where the first element is taken as the pivot:
The Haskell logo at the top of this page was generated by Haskell code.[45]
Implementations
All listed implementations are distributed under open source licenses.[39]
Implementations that fully or nearly comply with the Haskell 98 standard, include:
The Glasgow Haskell Compiler (GHC) compiles to native code on many different processor architectures, and to ANSI C, via one of two intermediate languages: C--, or in more recent versions, LLVM (formerly Low Level Virtual Machine) bitcode.[47][48] GHC has become the de facto standard Haskell dialect.[49] There are libraries (e.g., bindings to OpenGL) that work only with GHC. GHC is also distributed with the Haskell platform.
The Utrecht Haskell Compiler (UHC) is a Haskell implementation from Utrecht University.[50] It supports almost all Haskell 98 features plus many experimental extensions. It is implemented using attribute grammars and is currently used mostly for research on generated type systems and language extensions.
Jhc, a Haskell compiler written by John Meacham, emphasizes speed and efficiency of generated programs and exploring new program transformations. Ajhc is a fork of Jhc.
LHC is a whole-program optimizing backend for GHC, based on Urban Boquist’s compiler intermediate language, GRIN.[51] Older versions of LHC were based on Jhc rather than GHC.
Implementations no longer actively maintained include:
The Haskell User's Gofer System (Hugs) is a bytecode interpreter. It was once one of the implementations used most widely, alongside the GHC compiler,[52] but has now been mostly replaced by GHCi. It also comes with a graphics library.
nhc98 is a bytecode compiler focusing on minimizing memory use. The York Haskell Compiler (Yhc) was a fork of nhc98, with the goals of being simpler, more portable and efficient, and integrating support for Hat, the Haskell tracer. It also had a JavaScript backend, allowing users to run Haskell programs in web browsers.
HBC is an early implementation supporting Haskell 1.4. It was implemented by Lennart Augustsson in, and based on, Lazy ML. It has not been actively developed for some time.
Implementations not fully Haskell 98 compliant, and using a variant Haskell language, include:
Gofer was an educational dialect of Haskell, with a feature called constructor classes, developed by Mark Jones. It was supplanted by Hugs (see Haskell User's Gofer System, above).
Helium is a newer dialect of Haskell. The focus is on making learning easier via clearer error messages. It currently lacks full support for type classes, rendering it incompatible with many Haskell programs.
Eta and Frege are dialects of Haskell targeting the Java Virtual Machine.
Applications
Darcs is a revision control system written in Haskell, with several innovative features, such as more precise control of patches to apply.
Cabal is a tool for building and packaging Haskell libraries and programs.[53]
Linspire GNU/Linux chose Haskell for system tools development.[54]
Xmonad is a window manager for the X Window System, written fully in Haskell.[55]
Git-annex is a tool to manage (big) data files under Git version control. It also provides a distributed file synchronization system (git-annex assistant).
GHC is also often a testbed for advanced functional programming features and optimizations in other programming languages.
Pandoc is a tool to convert one markup format into another.
The Shake build system, aiming to be reliable, robust, and fast.[56]
ShellCheck – A shell script static analysis tool.[57]
Industry
Facebook implements its anti-spam programs[58] in Haskell, as open-source software.[59]
Swift Navigation, a high precision GPS manufacturer, implements significant portions of its product in Haskell, providing some open source software.[60]
Bluespec SystemVerilog (BSV) is a language for semiconductor design that is an extension of Haskell. Also, Bluespec, Inc.'s tools are implemented in Haskell.
GitHub implemented Semantic, an open-source library for analysis, diffing, and interpretation of untrusted source code, in Haskell.[61]
Cryptol, a language and toolchain for developing and verifying cryptography algorithms, is implemented in Haskell.
Ampersand[62], a language[63] and toolchain[64] for building information systems with reactive behaviour, is being used by Ordina and TNO (both in the Netherlands) for purposes such as conceptual analysis, analysis of archimate-models, legal analysis and prototyping large information systems. The compiler[65] is written in Haskell. Web applications realised with Ampersand comprise the Semantic Treehouse[66] and RAP3[67].
seL4, the first formally verified microkernel,[68] used Haskell as a prototyping language for the OS developer.[68] At the same time, the Haskell code defined an executable specification with which to reason, for automatic translation by the theorem-proving tool.[68] The Haskell code thus served as an intermediate prototype before final C refinement.[68]
Criticism
Jan-Willem Maessen, in 2002, and Simon Peyton Jones, in 2003, discussed problems associated with lazy evaluation while also acknowledging the theoretical motives for it.[71][72] In addition to purely practical considerations such as improved performance,[73] they note that, in addition to adding some performance overhead, lazy evaluation makes it more difficult for programmers to reason about the performance of their code (particularly its space use).
Bastiaan Heeren, Daan Leijen, and Arjan van IJzendoorn in 2003 also observed some stumbling blocks for Haskell learners: "The subtle syntax and sophisticated type system of Haskell are a double edged sword – highly appreciated by experienced programmers but also a source of frustration among beginners, since the generality of Haskell often leads to cryptic error messages."[74] To address these, researchers from Utrecht University developed an advanced interpreter called Helium, which improved the user-friendliness of error messages by limiting the generality of some Haskell features, and in particular removing support for type classes.
Ben Lippmeier designed Disciple[75] as a strict-by-default (lazy by explicit annotation) dialect of Haskell with a type-and-effect system, to address Haskell's difficulties in reasoning about lazy evaluation and in using traditional data structures such as mutable arrays.[76] He argues (p. 20) that "destructive update furnishes the programmer with two important and powerful tools... a set of efficient array-like data structures for managing collections of objects, and... the ability to broadcast a new value to all parts of a program with minimal burden on the programmer."
Robert Harper, one of the authors of Standard ML, has given his reasons for not using Haskell to teach introductory programming. Among these are the difficulty of reasoning about resource use with non-strict evaluation, that lazy evaluation complicates the definition of data types and inductive reasoning,[77] and the "inferiority" of Haskell's (old) class system compared to ML's module system.[78]
Haskell's build tool, Cabal, has historically been criticised for poorly handling multiple versions of the same library, a problem known as "Cabal hell". The Stackage server and Stack build tool were made in response to these criticisms.[79].
Related languages
Clean is a close, slightly older relative of Haskell. Its biggest deviation from Haskell is in the use of uniqueness types instead of monads for I/O and side-effects.
A series of languages inspired by Haskell, but with different type systems, have been developed, including:
Agda, a functional language with dependent types.
Idris, a general purpose functional language with dependent types, developed at the University of St Andrews.
Epigram, a functional language with dependent types suitable for proving properties of programs.
Cayenne, with dependent types.
Ωmega, strict and more.
Elm, a functional language to create web front-end apps, no support for higher-kinded types.
Java virtual machine (JVM) based:
Other related languages include:
Curry, a functional/logic programming language based on Haskell.
Haskell has served as a testbed for many new ideas in language design.
There have been many Haskell variants produced, exploring new language ideas, including:
Parallel Haskell: From Glasgow University, supports clusters of machines or single multiprocessors.[84][85] Also within Haskell is support for Symmetric Multiprocessor parallelism.[86] From MIT.[87]
Distributed Haskell (formerly Goffin) and Eden.
Eager Haskell, based on speculative evaluation.
Several object-oriented versions: Haskell++, and Mondrian.
Generic Haskell, a version of Haskell with type system support for generic programming.
O'Haskell, an extension of Haskell adding object-orientation and concurrent programming support that "has... been superseded by Timber."[88]
Disciple, a strict-by-default (laziness available by annotation) dialect of Haskell that supports destructive update, computational effects, type directed field projections and allied functional aspects.
Hume, a strict functional language for embedded systems based on processes as stateless automata over a sort of tuples of one element mailbox channels where the state is kept by feedback into the mailboxes, and a mapping description from outputs to channels as box wiring, with a Haskell-like expression language and syntax.
Conferences and workshops
The Haskell community meets regularly for research and development activities.
The main events are:
International Conference on Functional Programming (ICFP)
Haskell Symposium (formerly the Haskell Workshop)
Haskell Implementors Workshop
Commercial Users of Functional Programming (CUFP)
Since 2006, a series of organized hackathons has occurred, the Hac series, aimed at improving the programming language tools and libraries.[89]