1
0
mirror of https://github.com/alecthomas/chroma.git synced 2025-11-23 22:24:39 +02:00
Go to file
renovate[bot] 0e031c7fbf chore(deps): update all non-major dependencies (#1108)
This PR contains the following updates:

| Package | Type | Update | Change | Age | Confidence |
|---|---|---|---|---|---|
| [esbuild](https://redirect.github.com/evanw/esbuild) | | patch |
`0.25.6` -> `0.25.8` |
[![age](https://developer.mend.io/api/mc/badges/age/hermit/esbuild/0.25.8?slim=true)](https://docs.renovatebot.com/merge-confidence/)
|
[![confidence](https://developer.mend.io/api/mc/badges/confidence/hermit/esbuild/0.25.6/0.25.8?slim=true)](https://docs.renovatebot.com/merge-confidence/)
|
|
[github.com/alecthomas/kong](https://redirect.github.com/alecthomas/kong)
| require | patch | `v1.12.0` -> `v1.12.1` |
[![age](https://developer.mend.io/api/mc/badges/age/go/github.com%2falecthomas%2fkong/v1.12.1?slim=true)](https://docs.renovatebot.com/merge-confidence/)
|
[![confidence](https://developer.mend.io/api/mc/badges/confidence/go/github.com%2falecthomas%2fkong/v1.12.0/v1.12.1?slim=true)](https://docs.renovatebot.com/merge-confidence/)
|

---

### Release Notes

<details>
<summary>evanw/esbuild (esbuild)</summary>

###
[`v0.25.8`](https://redirect.github.com/evanw/esbuild/blob/HEAD/CHANGELOG.md#0258)

[Compare
Source](https://redirect.github.com/evanw/esbuild/compare/v0.25.7...v0.25.8)

- Fix another TypeScript parsing edge case
([#&#8203;4248](https://redirect.github.com/evanw/esbuild/issues/4248))

This fixes a regression with a change in the previous release that tries
to more accurately parse TypeScript arrow functions inside the `?:`
operator. The regression specifically involves parsing an arrow function
containing a `#private` identifier inside the middle of a `?:` ternary
operator inside a class body. This was fixed by propagating private
identifier state into the parser clone used to speculatively parse the
arrow function body. Here is an example of some affected code:

  ```ts
  class CachedDict {
    #has = (a: string) => dict.has(a);
    has = window
      ? (word: string): boolean => this.#has(word)
      : this.#has;
  }
  ```

- Fix a regression with the parsing of source phase imports

The change in the previous release to parse [source phase
imports](https://redirect.github.com/tc39/proposal-source-phase-imports)
failed to properly handle the following cases:

  ```ts
  import source from 'bar'
  import source from from 'bar'
  import source type foo from 'bar'
  ```

Parsing for these cases should now be fixed. The first case was
incorrectly treated as a syntax error because esbuild was expecting the
second case. And the last case was previously allowed but is now
forbidden. TypeScript hasn't added this feature yet so it remains to be
seen whether the last case will be allowed, but it's safer to disallow
it for now. At least Babel doesn't allow the last case when parsing
TypeScript, and Babel was involved with the source phase import
specification.

###
[`v0.25.7`](https://redirect.github.com/evanw/esbuild/blob/HEAD/CHANGELOG.md#0257)

[Compare
Source](https://redirect.github.com/evanw/esbuild/compare/v0.25.6...v0.25.7)

- Parse and print JavaScript imports with an explicit phase
([#&#8203;4238](https://redirect.github.com/evanw/esbuild/issues/4238))

This release adds basic syntax support for the `defer` and `source`
import phases in JavaScript:

  - `defer`

This is a [stage 3
proposal](https://redirect.github.com/tc39/proposal-defer-import-eval)
for an upcoming JavaScript feature that will provide one way to eagerly
load but lazily initialize imported modules. The imported module is
automatically initialized on first use. Support for this syntax will
also be part of the upcoming release of [TypeScript
5.9](https://devblogs.microsoft.com/typescript/announcing-typescript-5-9-beta/#support-for-import-defer).
The syntax looks like this:

    ```js
    import defer * as foo from "<specifier>";
    const bar = await import.defer("<specifier>");
    ```

Note that this feature deliberately cannot be used with the syntax
`import defer foo from "<specifier>"` or `import defer { foo } from
"<specifier>"`.

  - `source`

This is a [stage 3
proposal](https://redirect.github.com/tc39/proposal-source-phase-imports)
for an upcoming JavaScript feature that will provide another way to
eagerly load but lazily initialize imported modules. The imported module
is returned in an uninitialized state. Support for this syntax may or
may not be a part of TypeScript 5.9 (see [this
issue](https://redirect.github.com/microsoft/TypeScript/issues/61216)
for details). The syntax looks like this:

    ```js
    import source foo from "<specifier>";
    const bar = await import.source("<specifier>");
    ```

Note that this feature deliberately cannot be used with the syntax
`import defer * as foo from "<specifier>"` or `import defer { foo } from
"<specifier>"`.

This change only adds support for this syntax. These imports cannot
currently be bundled by esbuild. To use these new features with
esbuild's bundler, the imported paths must be external to the bundle and
the output format must be set to `esm`.

- Support optionally emitting absolute paths instead of relative paths
([#&#8203;338](https://redirect.github.com/evanw/esbuild/issues/338),
[#&#8203;2082](https://redirect.github.com/evanw/esbuild/issues/2082),
[#&#8203;3023](https://redirect.github.com/evanw/esbuild/issues/3023))

This release introduces the `--abs-paths=` feature which takes a
comma-separated list of situations where esbuild should use absolute
paths instead of relative paths. There are currently three supported
situations: `code` (comments and string literals), `log` (log message
text and location info), and `metafile` (the JSON build metadata).

Using absolute paths instead of relative paths is not the default
behavior because it means that the build results are no longer
machine-independent (which means builds are no longer reproducible).
Absolute paths can be useful when used with certain terminal emulators
that allow you to click on absolute paths in the terminal text and/or
when esbuild is being automatically invoked from several different
directories within the same script.

- Fix a TypeScript parsing edge case
([#&#8203;4241](https://redirect.github.com/evanw/esbuild/issues/4241))

This release fixes an edge case with parsing an arrow function in
TypeScript with a return type that's in the middle of a `?:` ternary
operator. For example:

  ```ts
  x = a ? (b) : c => d;
  y = a ? (b) : c => d : e;
  ```

The `:` token in the value assigned to `x` pairs with the `?` token, so
it's not the start of a return type annotation. However, the first `:`
token in the value assigned to `y` is the start of a return type
annotation because after parsing the arrow function body, it turns out
there's another `:` token that can be used to pair with the `?` token.
This case is notable as it's the first TypeScript edge case that esbuild
has needed a backtracking parser to parse. It has been addressed by a
quick hack (cloning the whole parser) as it's a rare edge case and
esbuild doesn't otherwise need a backtracking parser. Hopefully this is
sufficient and doesn't cause any issues.

- Inline small constant strings when minifying

Previously esbuild's minifier didn't inline string constants because
strings can be arbitrarily long, and this isn't necessarily a size win
if the string is used more than once. Starting with this release,
esbuild will now inline string constants when the length of the string
is three code units or less. For example:

  ```js
  // Original code
  const foo = 'foo'
  console.log({ [foo]: true })

  // Old output (with --minify --bundle --format=esm)
  var o="foo";console.log({[o]:!0});

  // New output (with --minify --bundle --format=esm)
  console.log({foo:!0});
  ```

Note that esbuild's constant inlining only happens in very restrictive
scenarios to avoid issues with TDZ handling. This change doesn't change
when esbuild's constant inlining happens. It only expands the scope of
it to include certain string literals in addition to numeric and boolean
literals.

</details>

<details>
<summary>alecthomas/kong (github.com/alecthomas/kong)</summary>

###
[`v1.12.1`](https://redirect.github.com/alecthomas/kong/compare/v1.12.0...v1.12.1)

[Compare
Source](https://redirect.github.com/alecthomas/kong/compare/v1.12.0...v1.12.1)

</details>

---

### Configuration

📅 **Schedule**: Branch creation - Between 12:00 AM and 03:59 AM, only on
Monday ( * 0-3 * * 1 ) (UTC), Automerge - At any time (no schedule
defined).

🚦 **Automerge**: Disabled by config. Please merge this manually once you
are satisfied.

♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the
rebase/retry checkbox.

👻 **Immortal**: This PR will be recreated if closed unmerged. Get
[config
help](https://redirect.github.com/renovatebot/renovate/discussions) if
that's undesired.

---

- [ ] <!-- rebase-check -->If you want to rebase/retry this PR, check
this box

---

This PR was generated by [Mend Renovate](https://mend.io/renovate/).
View the [repository job
log](https://developer.mend.io/github/alecthomas/chroma).

<!--renovate-debug:eyJjcmVhdGVkSW5WZXIiOiI0MS4yMy4yIiwidXBkYXRlZEluVmVyIjoiNDEuMjMuMiIsInRhcmdldEJyYW5jaCI6Im1hc3RlciIsImxhYmVscyI6W119-->

Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
2025-07-22 08:00:34 +10:00
2025-07-01 09:59:45 +10:00
2025-07-16 07:46:57 +10:00
2025-07-01 09:59:45 +10:00
2025-07-01 10:18:33 +10:00
2025-07-01 10:18:33 +10:00
2025-07-01 09:59:45 +10:00

Chroma

A general purpose syntax highlighter in pure Go

Go Reference CI Slack chat

Chroma takes source code and other structured text and converts it into syntax highlighted HTML, ANSI-coloured text, etc.

Chroma is based heavily on Pygments, and includes translators for Pygments lexers and styles.

Table of Contents

  1. Supported languages
  2. Try it
  3. Using the library
    1. Quick start
    2. Identifying the language
    3. Formatting the output
    4. The HTML formatter
  4. More detail
    1. Lexers
    2. Formatters
    3. Styles
  5. Command-line interface
  6. Testing lexers
  7. What's missing compared to Pygments?

Supported languages

Prefix Language
A ABAP, ABNF, ActionScript, ActionScript 3, Ada, Agda, AL, Alloy, Angular2, ANTLR, ApacheConf, APL, AppleScript, ArangoDB AQL, Arduino, ArmAsm, AutoHotkey, AutoIt, Awk
B Ballerina, Bash, Bash Session, Batchfile, BibTeX, Bicep, BlitzBasic, BNF, BQN, Brainfuck
C C, C#, C++, Caddyfile, Caddyfile Directives, Cap'n Proto, Cassandra CQL, Ceylon, CFEngine3, cfstatement, ChaiScript, Chapel, Cheetah, Clojure, CMake, COBOL, CoffeeScript, Common Lisp, Coq, Crystal, CSS, Cython
D D, Dart, Dax, Desktop Entry, Diff, Django/Jinja, dns, Docker, DTD, Dylan
E EBNF, Elixir, Elm, EmacsLisp, Erlang
F Factor, Fennel, Fish, Forth, Fortran, FortranFixed, FSharp
G GAS, GDScript, Genshi, Genshi HTML, Genshi Text, Gherkin, Gleam, GLSL, Gnuplot, Go, Go HTML Template, Go Text Template, GraphQL, Groff, Groovy
H Handlebars, Hare, Haskell, Haxe, HCL, Hexdump, HLB, HLSL, HolyC, HTML, HTTP, Hy
I Idris, Igor, INI, Io, ISCdhcpd
J J, Java, JavaScript, JSON, Jsonnet, Julia, Jungle
K Kotlin
L Lean, Lighttpd configuration file, LLVM, Lua
M Makefile, Mako, markdown, Mason, Materialize SQL dialect, Mathematica, Matlab, MCFunction, Meson, Metal, MiniZinc, MLIR, Modula-2, Mojo, MonkeyC, MorrowindScript, Myghty, MySQL
N NASM, Natural, Newspeak, Nginx configuration file, Nim, Nix, NSIS
O Objective-C, OCaml, Octave, Odin, OnesEnterprise, OpenEdge ABL, OpenSCAD, Org Mode
P PacmanConf, Perl, PHP, PHTML, Pig, PkgConfig, PL/pgSQL, plaintext, Plutus Core, Pony, PostgreSQL SQL dialect, PostScript, POVRay, PowerQuery, PowerShell, Prolog, PromQL, Promela, properties, Protocol Buffer, PRQL, PSL, Puppet, Python, Python 2
Q QBasic, QML
R R, Racket, Ragel, Raku, react, ReasonML, reg, Rego, reStructuredText, Rexx, RPMSpec, Ruby, Rust
S SAS, Sass, Scala, Scheme, Scilab, SCSS, Sed, Sieve, Smali, Smalltalk, Smarty, SNBT, Snobol, Solidity, SourcePawn, SPARQL, SQL, SquidConf, Standard ML, stas, Stylus, Svelte, Swift, SYSTEMD, systemverilog
T TableGen, Tal, TASM, Tcl, Tcsh, Termcap, Terminfo, Terraform, TeX, Thrift, TOML, TradingView, Transact-SQL, Turing, Turtle, Twig, TypeScript, TypoScript, TypoScriptCssData, TypoScriptHtmlData, Typst
V V, V shell, Vala, VB.net, verilog, VHDL, VHS, VimL, vue
W WDTE, WebGPU Shading Language, Whiley
X XML, Xorg
Y YAML, YANG
Z Z80 Assembly, Zed, Zig

I will attempt to keep this section up to date, but an authoritative list can be displayed with chroma --list.

Try it

Try out various languages and styles on the Chroma Playground.

Using the library

This is version 2 of Chroma, use the import path:

import "github.com/alecthomas/chroma/v2"

Chroma, like Pygments, has the concepts of lexers, formatters and styles.

Lexers convert source text into a stream of tokens, styles specify how token types are mapped to colours, and formatters convert tokens and styles into formatted output.

A package exists for each of these, containing a global Registry variable with all of the registered implementations. There are also helper functions for using the registry in each package, such as looking up lexers by name or matching filenames, etc.

In all cases, if a lexer, formatter or style can not be determined, nil will be returned. In this situation you may want to default to the Fallback value in each respective package, which provides sane defaults.

Quick start

A convenience function exists that can be used to simply format some source text, without any effort:

err := quick.Highlight(os.Stdout, someSourceCode, "go", "html", "monokai")

Identifying the language

To highlight code, you'll first have to identify what language the code is written in. There are three primary ways to do that:

  1. Detect the language from its filename.

    lexer := lexers.Match("foo.go")
    
  2. Explicitly specify the language by its Chroma syntax ID (a full list is available from lexers.Names()).

    lexer := lexers.Get("go")
    
  3. Detect the language from its content.

    lexer := lexers.Analyse("package main\n\nfunc main()\n{\n}\n")
    

In all cases, nil will be returned if the language can not be identified.

if lexer == nil {
  lexer = lexers.Fallback
}

At this point, it should be noted that some lexers can be extremely chatty. To mitigate this, you can use the coalescing lexer to coalesce runs of identical token types into a single token:

lexer = chroma.Coalesce(lexer)

Formatting the output

Once a language is identified you will need to pick a formatter and a style (theme).

style := styles.Get("swapoff")
if style == nil {
  style = styles.Fallback
}
formatter := formatters.Get("html")
if formatter == nil {
  formatter = formatters.Fallback
}

Then obtain an iterator over the tokens:

contents, err := ioutil.ReadAll(r)
iterator, err := lexer.Tokenise(nil, string(contents))

And finally, format the tokens from the iterator:

err := formatter.Format(w, style, iterator)

The HTML formatter

By default the html registered formatter generates standalone HTML with embedded CSS. More flexibility is available through the formatters/html package.

Firstly, the output generated by the formatter can be customised with the following constructor options:

  • Standalone() - generate standalone HTML with embedded CSS.
  • WithClasses() - use classes rather than inlined style attributes.
  • ClassPrefix(prefix) - prefix each generated CSS class.
  • TabWidth(width) - Set the rendered tab width, in characters.
  • WithLineNumbers() - Render line numbers (style with LineNumbers).
  • WithLinkableLineNumbers() - Make the line numbers linkable and be a link to themselves.
  • HighlightLines(ranges) - Highlight lines in these ranges (style with LineHighlight).
  • LineNumbersInTable() - Use a table for formatting line numbers and code, rather than spans.

If WithClasses() is used, the corresponding CSS can be obtained from the formatter with:

formatter := html.New(html.WithClasses(true))
err := formatter.WriteCSS(w, style)

More detail

Lexers

See the Pygments documentation for details on implementing lexers. Most concepts apply directly to Chroma, but see existing lexer implementations for real examples.

In many cases lexers can be automatically converted directly from Pygments by using the included Python 3 script pygments2chroma_xml.py. I use something like the following:

python3 _tools/pygments2chroma_xml.py \
  pygments.lexers.jvm.KotlinLexer \
  > lexers/embedded/kotlin.xml

A list of all lexers available in Pygments can be found in pygments-lexers.txt.

Formatters

Chroma supports HTML output, as well as terminal output in 8 colour, 256 colour, and true-colour.

A noop formatter is included that outputs the token text only, and a tokens formatter outputs raw tokens. The latter is useful for debugging lexers.

Styles

Chroma styles are defined in XML. The style entries use the same syntax as Pygments.

All Pygments styles have been converted to Chroma using the _tools/style.py script.

When you work with one of Chroma's styles, know that the Background token type provides the default style for tokens. It does so by defining a foreground color and background color.

For example, this gives each token name not defined in the style a default color of #f8f8f8 and uses #000000 for the highlighted code block's background:

<entry type="Background" style="#f8f8f2 bg:#000000"/>

Also, token types in a style file are hierarchical. For instance, when CommentSpecial is not defined, Chroma uses the token style from Comment. So when several comment tokens use the same color, you'll only need to define Comment and override the one that has a different color.

For a quick overview of the available styles and how they look, check out the Chroma Style Gallery.

Command-line interface

A command-line interface to Chroma is included.

Binaries are available to install from the releases page.

The CLI can be used as a preprocessor to colorise output of less(1), see documentation for the LESSOPEN environment variable.

The --fail flag can be used to suppress output and return with exit status 1 to facilitate falling back to some other preprocessor in case chroma does not resolve a specific lexer to use for the given file. For example:

export LESSOPEN='| p() { chroma --fail "$1" || cat "$1"; }; p "%s"'

Replace cat with your favourite fallback preprocessor.

When invoked as .lessfilter, the --fail flag is automatically turned on under the hood for easy integration with lesspipe shipping with Debian and derivatives; for that setup the chroma executable can be just symlinked to ~/.lessfilter.

Projects using Chroma

Testing lexers

If you edit some lexers and want to try it, open a shell in cmd/chromad and run:

go run . --csrf-key=securekey

A Link will be printed. Open it in your Browser. Now you can test on the Playground with your local changes.

If you want to run the tests and the lexers, open a shell in the root directory and run:

go test ./lexers

When updating or adding a lexer, please add tests. See lexers/README.md for more.

What's missing compared to Pygments?

  • Quite a few lexers, for various reasons (pull-requests welcome):
    • Pygments lexers for complex languages often include custom code to handle certain aspects, such as Raku's ability to nest code inside regular expressions. These require time and effort to convert.
    • I mostly only converted languages I had heard of, to reduce the porting cost.
  • Some more esoteric features of Pygments are omitted for simplicity.
  • Though the Chroma API supports content detection, very few languages support them. I have plans to implement a statistical analyser at some point, but not enough time.
Description
A general purpose syntax highlighter in pure Go
Readme 20 MiB
Languages
Go 95.1%
JavaScript 2.9%
Python 1.4%
Dockerfile 0.2%
Shell 0.2%
Other 0.2%