1
0
mirror of https://github.com/jlevy/the-art-of-command-line.git synced 2024-12-04 10:24:46 +02:00
the-art-of-command-line/README.md

506 lines
28 KiB
Markdown
Raw Normal View History

[ Languages: [English](README.md), [Español](README-es.md), [한국어](README-ko.md), [Português](README-pt.md), [Русский](README-ru.md), [Slovenščina](README-sl.md), [中文](README-zh.md) ]
2015-06-27 15:15:13 +02:00
2015-05-20 19:39:58 +02:00
# The Art of Command Line
2015-05-20 18:02:38 +02:00
2015-06-28 20:04:25 +02:00
[![Join the chat at https://gitter.im/jlevy/the-art-of-command-line](https://badges.gitter.im/Join%20Chat.svg)](https://gitter.im/jlevy/the-art-of-command-line?utm_source=badge&utm_medium=badge&utm_campaign=pr-badge&utm_content=badge)
2015-06-18 05:34:09 +02:00
- [Meta](#meta)
- [Basics](#basics)
2015-05-22 05:57:55 +02:00
- [Everyday use](#everyday-use)
2015-05-31 23:46:32 +02:00
- [Processing files and data](#processing-files-and-data)
2015-05-22 05:57:55 +02:00
- [System debugging](#system-debugging)
- [One-liners](#one-liners)
- [Obscure but useful](#obscure-but-useful)
2015-07-15 01:24:26 +02:00
- [MacOS X only](#macos-x-only)
2015-05-22 23:03:11 +02:00
- [More resources](#more-resources)
2015-05-22 05:57:55 +02:00
- [Disclaimer](#disclaimer)
2015-05-20 19:20:08 +02:00
2015-06-02 20:18:42 +02:00
![curl -s 'https://raw.githubusercontent.com/jlevy/the-art-of-command-line/master/README.md' | egrep -o '`\w+`' | tr -d '`' | cowsay -W50](cowsay.png)
Fluency on the command line is a skill often neglected or considered arcane, but it improves your flexibility and productivity as an engineer in both obvious and subtle ways. This is a selection of notes and tips on using the command-line that I've found useful when working on Linux. Some tips are elementary, and some are fairly specific, sophisticated, or obscure. This page is not long, but if you can use and recall all the items here, you know a lot.
2015-05-20 18:02:38 +02:00
2015-05-22 05:57:55 +02:00
Much of this
[originally](http://www.quora.com/What-are-some-lesser-known-but-useful-Unix-commands)
[appeared](http://www.quora.com/What-are-the-most-useful-Swiss-army-knife-one-liners-on-Unix)
2015-05-22 07:12:28 +02:00
on [Quora](http://www.quora.com/What-are-some-time-saving-tips-that-every-Linux-user-should-know),
but given the interest there, it seems it's worth using Github, where people more talented than I can readily suggest improvements. If you see an error or something that could be better, please submit an issue or PR! (Of course please review the meta section and existing PRs/issues first.)
2015-05-20 18:02:38 +02:00
2015-06-18 05:34:09 +02:00
## Meta
2015-05-22 05:57:55 +02:00
Scope:
- This guide is both for beginners and the experienced. The goals are *breadth* (everything important), *specificity* (give concrete examples of the most common case), and *brevity* (avoid things that aren't essential or digressions you can easily look up elsewhere). Every tip is essential in some situation or significantly saves time over alternatives.
2015-07-15 01:24:26 +02:00
- This is written for Linux, with the exception of the "[MacOS X only](#macos-x-only)" section. Many of the other items apply or can be installed on other Unices or MacOS (or even Cygwin).
2015-05-22 05:57:55 +02:00
- The focus is on interactive Bash, though many tips apply to other shells and to general Bash scripting.
- It includes both "standard" Unix commands as well as ones that require special package installs -- so long as they are important enough to merit inclusion.
2015-06-18 05:34:09 +02:00
Notes:
- To keep this to one page, content is implicitly included by reference. You're smart enough to look up more detail elsewhere once you know the idea or command to Google. Use `apt-get`/`yum`/`dnf`/`pacman`/`pip`/`brew` (as appropriate) to install new programs.
2015-06-18 06:16:23 +02:00
- Use [Explainshell](http://explainshell.com/) to get a helpful breakdown of what commands, options, pipes etc. do.
2015-05-22 05:57:55 +02:00
## Basics
2015-05-20 18:02:38 +02:00
2015-06-16 09:09:48 +02:00
- Learn basic Bash. Actually, type `man bash` and at least skim the whole thing; it's pretty easy to follow and not that long. Alternate shells can be nice, but Bash is powerful and always available (learning *only* zsh, fish, etc., while tempting on your own laptop, restricts you in many situations, such as using existing servers).
- Learn at least one text-based editor well. Ideally Vim (`vi`), as there's really no competition for random editing in a terminal (even if you use Emacs, a big IDE, or a modern hipster editor most of the time).
2015-06-20 09:19:56 +02:00
- Know how to read documentation with `man` (for the inquisitive, `man man` lists the section numbers, e.g. 1 is "regular" commands, 5 is files/conventions, and 8 are for administration). Find man pages with `apropos`. Know that some commands are not executables, but Bash builtins, and that you can get help on them with `help` and `help -d`.
2015-06-20 08:48:44 +02:00
2015-07-03 09:54:30 +02:00
- Learn about redirection of output and input using `>` and `<` and pipes using `|`. Know `>` overwrites the output file and `>>` appends. Learn about stdout and stderr.
2015-08-11 13:54:12 +02:00
- Learn about file glob expansion with `*` (and perhaps `?` and `[`...`]`) and quoting and the difference between double `"` and single `'` quotes. (See more on variable expansion below.)
2015-06-16 09:09:48 +02:00
- Be familiar with Bash job management: `&`, **ctrl-z**, **ctrl-c**, `jobs`, `fg`, `bg`, `kill`, etc.
- Know `ssh`, and the basics of passwordless authentication, via `ssh-agent`, `ssh-add`, etc.
- Basic file management: `ls` and `ls -l` (in particular, learn what every column in `ls -l` means), `less`, `head`, `tail` and `tail -f` (or even better, `less +F`), `ln` and `ln -s` (learn the differences and advantages of hard versus soft links), `chown`, `chmod`, `du` (for a quick summary of disk usage: `du -hs *`). For filesystem management, `df`, `mount`, `fdisk`, `mkfs`, `lsblk`.
2015-05-20 19:39:58 +02:00
- Basic network management: `ip` or `ifconfig`, `dig`.
2015-07-20 21:37:30 +02:00
- Know regular expressions well, and the various flags to `grep`/`egrep`. The `-i`, `-o`, `-v`, `-A`, `-B`, and `-C` options are worth knowing.
- Learn to use `apt-get`, `yum`, `dnf` or `pacman` (depending on distro) to find and install packages. And make sure you have `pip` to install Python-based command-line tools (a few below are easiest to install via `pip`).
2015-05-20 18:02:38 +02:00
2015-05-20 19:20:08 +02:00
2015-05-20 18:02:38 +02:00
## Everyday use
- In Bash, use **Tab** to complete arguments or list all available commands and **ctrl-r** to search through command history (after pressing, type to search, press **ctrl-r** repeatedly to cycle through more matches, press **Enter** to execute the found command, or hit the right arrow to put the result in the current line to allow editing).
- In Bash, use **ctrl-w** to delete the last word, and **ctrl-u** to delete all the way back to the start of the line. Use **alt-b** and **alt-f** to move by word, **ctrl-a** to move cursor to beginning of line, **ctrl-e** to move cursor to end of line, **ctrl-k** to kill to the end of the line, **ctrl-l** to clear the screen. See `man readline` for all the default keybindings in Bash. There are a lot. For example **alt-.** cycles through previous arguments, and **alt-*** expands a glob.
- Alternatively, if you love vi-style key-bindings, use `set -o vi` (and `set -o emacs` to put it back).
- For editing long commands, after setting your editor (for example `export EDITOR=vim`), **ctrl-x** **ctrl-e** will open the current command in an editor for multi-line editing. Or in vi style, **escape-v**.
2015-06-21 17:48:46 +02:00
- To see recent commands, `history`. There are also many abbreviations such as `!$` (last argument) and `!!` last command, though these are often easily replaced with **ctrl-r** and **alt-.**.
- To go back to the previous working directory: `cd -`
- If you are halfway through typing a command but change your mind, hit **alt-#** to add a `#` at the beginning and enter it as a comment (or use **ctrl-a**, **#**, **enter**). You can then return to it later via command history.
- Use `xargs` (or `parallel`). It's very powerful. Note you can control how many items execute per line (`-L`) as well as parallelism (`-P`). If you're not sure if it'll do the right thing, use `xargs echo` first. Also, `-I{}` is handy. Examples:
2015-06-16 17:05:32 +02:00
```bash
find . -name '*.py' | xargs grep some_function
2015-05-20 18:02:38 +02:00
cat hosts | xargs -I{} ssh root@{} hostname
2015-05-20 19:20:08 +02:00
```
2015-05-20 18:02:38 +02:00
2015-05-20 19:39:58 +02:00
- `pstree -p` is a helpful display of the process tree.
2015-05-20 19:39:58 +02:00
- Use `pgrep` and `pkill` to find or signal processes by name (`-f` is helpful).
2015-05-20 19:57:40 +02:00
- Know the various signals you can send processes. For example, to suspend a process, use `kill -STOP [pid]`. For the full list, see `man 7 signal`
2015-05-20 19:57:40 +02:00
- Use `nohup` or `disown` if you want a background process to keep running forever.
2015-06-21 09:31:43 +02:00
- Check what processes are listening via `netstat -lntp` or `ss -plat` (for TCP; add `-u` for UDP).
2015-05-20 19:57:40 +02:00
- See also `lsof` for open sockets and files.
2015-05-20 19:39:58 +02:00
2015-07-24 01:45:37 +02:00
- See `uptime` or `w` to know the how long the system has been running.
- Use `alias` to create shortcuts for commonly used commands. For example, `alias ll='ls -latr'` creates a new alias `ll`.
- In Bash scripts, use `set -x` (or the variant `set -v`, which logs raw input, including unexpanded variables and comments) for debugging output. Use strict modes whenever possible: Use `set -e` to abort on errors and `set -o pipefail` as to abort within pipes, too (though this topic is a bit subtle). Use `set -u` to detect unset variable usages. For more involved scripts, also use `trap` on EXIT or ERR. A useful habit is to start a script like so, which will make it detect and abort on common errors and print a message:
```bash
set -euo pipefail
trap "echo 'error: Script failed: see last command above'" ERR
```
2015-06-16 09:09:48 +02:00
- In Bash scripts, subshells (written with parentheses) are convenient ways to group commands. A common example is to temporarily move to a different working directory, e.g.
2015-06-16 17:05:32 +02:00
```bash
2015-05-20 18:02:38 +02:00
# do something in current dir
2015-06-18 00:50:38 +02:00
(cd /some/other/dir && other-command)
2015-05-20 18:02:38 +02:00
# continue in original dir
2015-05-20 19:20:08 +02:00
```
2015-05-20 18:02:38 +02:00
2015-06-16 09:09:48 +02:00
- In Bash, note there are lots of kinds of variable expansion. Checking a variable exists: `${name:?error message}`. For example, if a Bash script requires a single argument, just write `input_file=${1:?usage: $0 input_file}`. Arithmetic expansion: `i=$(( (i + 1) % 5 ))`. Sequences: `{1..10}`. Trimming of strings: `${var%suffix}` and `${var#prefix}`. For example if `var=foo.pdf`, then `echo ${var%.pdf}.txt` prints `foo.txt`.
- Brace expansion using `{`...`}` can reduce having to re-type similar text and automate combinations of items. This is helpful in examples like `mv foo.{txt,pdf} some-dir` (which moves both files), `cp somefile{,.bak}` (which expands to
`cp somefile somefile.bak`) or `mkdir -p test-{a,b,c}/subtest-{1,2,3}` (which expands all possible combinations and creates a directory tree).
2015-06-24 08:00:16 +02:00
2015-05-20 19:57:40 +02:00
- The output of a command can be treated like a file via `<(some command)`. For example, compare local `/etc/hosts` with a remote one:
2015-06-16 17:05:32 +02:00
```sh
2015-05-20 19:39:58 +02:00
diff /etc/hosts <(ssh somehost cat /etc/hosts)
2015-05-20 19:20:08 +02:00
```
2015-06-16 09:09:48 +02:00
- Know about "here documents" in Bash, as in `cat <<EOF ...`.
- In Bash, redirect both standard output and standard error via: `some-command >logfile 2>&1` or `some-command &>logfile`. Often, to ensure a command does not leave an open file handle to standard input, tying it to the terminal you are in, it is also good practice to add `</dev/null`.
2015-06-17 01:33:37 +02:00
- Use `man ascii` for a good ASCII table, with hex and decimal values. For general encoding info, `man unicode`, `man utf-8`, and `man latin1` are helpful.
- Use `screen` or [`tmux`](https://tmux.github.io/) to multiplex the screen, especially useful on remote ssh sessions and to detach and re-attach to a session. A more minimal alternative for session persistence only is `dtach`.
2015-05-20 19:57:40 +02:00
- In ssh, knowing how to port tunnel with `-L` or `-D` (and occasionally `-R`) is useful, e.g. to access web sites from a remote server.
2015-07-12 11:58:27 +02:00
- It can be useful to make a few optimizations to your ssh configuration; for example, this `~/.ssh/config` contains settings to avoid dropped connections in certain network environments, uses compression (which is helpful with scp over low-bandwidth connections), and multiplex channels to the same server with a local control file:
2015-05-20 19:20:08 +02:00
```
2015-05-20 18:02:38 +02:00
TCPKeepAlive=yes
ServerAliveInterval=15
ServerAliveCountMax=6
Compression=yes
ControlMaster auto
ControlPath /tmp/%r@%h:%p
ControlPersist yes
2015-05-20 19:20:08 +02:00
```
2015-05-20 18:02:38 +02:00
- A few other options relevant to ssh are security sensitive and should be enabled with care, e.g. per subnet or host or in trusted networks: `StrictHostKeyChecking=no`, `ForwardAgent=yes`
2015-05-20 19:39:58 +02:00
- To get the permissions on a file in octal form, which is useful for system configuration but not available in `ls` and easy to bungle, use something like
2015-06-16 17:05:32 +02:00
```sh
2015-05-20 19:20:08 +02:00
stat -c '%A %a %n' /etc/timezone
```
2015-05-20 18:02:38 +02:00
2015-07-03 10:19:26 +02:00
- For interactive selection of values from the output of another command, use [`percol`](https://github.com/mooz/percol) or [`fzf`](https://github.com/junegunn/fzf).
2015-06-02 18:24:06 +02:00
- For interaction with files based on the output of another command (like `git`), use `fpp` ([PathPicker](https://github.com/facebook/PathPicker)).
2015-06-16 11:09:45 +02:00
- For a simple web server for all files in the current directory (and subdirs), available to anyone on your network, use:
`python -m SimpleHTTPServer 7777` (for port 7777 and Python 2) and `python -m http.server 7777` (for port 7777 and Python 3).
2015-07-03 09:59:32 +02:00
- For running a command with privileges, use `sudo` (for root) or `sudo -u` (for another user). Use `su` or `sudo bash` to actually run a shell as that user. Use `su -` to simulate a fresh login as root or another user.
2015-05-20 19:52:07 +02:00
2015-05-31 23:46:32 +02:00
## Processing files and data
2015-06-16 13:50:26 +02:00
- To locate a file by name in the current directory, `find . -iname '*something*'` (or similar). To find a file anywhere by name, use `locate something` (but bear in mind `updatedb` may not have indexed recently created files).
2015-05-31 23:46:32 +02:00
- For general searching through source or data files (more advanced than `grep -r`), use [`ag`](https://github.com/ggreer/the_silver_searcher).
2015-05-20 18:02:38 +02:00
2015-05-20 19:39:58 +02:00
- To convert HTML to text: `lynx -dump -stdin`
2015-06-02 20:30:05 +02:00
- For Markdown, HTML, and all kinds of document conversion, try [`pandoc`](http://pandoc.org/).
2015-05-20 19:39:58 +02:00
- If you must handle XML, `xmlstarlet` is old but good.
2015-07-12 11:58:27 +02:00
- For JSON, use [`jq`](http://stedolan.github.io/jq/).
2015-06-16 09:23:16 +02:00
- For Excel or CSV files, [csvkit](https://github.com/onyxfish/csvkit) provides `in2csv`, `csvcut`, `csvjoin`, `csvgrep`, etc.
2015-06-01 04:42:13 +02:00
- For Amazon S3, [`s3cmd`](https://github.com/s3tools/s3cmd) is convenient and [`s4cmd`](https://github.com/bloomreach/s4cmd) is faster. Amazon's [`aws`](https://github.com/aws/aws-cli) is essential for other AWS-related tasks.
2015-05-20 18:02:38 +02:00
- Know about `sort` and `uniq`, including uniq's `-u` and `-d` options -- see one-liners below. See also `comm`.
2015-05-20 19:39:58 +02:00
- Know about `cut`, `paste`, and `join` to manipulate text files. Many people use `cut` but forget about `join`.
2015-05-20 18:02:38 +02:00
- Know about `wc` to count newlines (`-l`), characters (`-m`), words (`-w`) and bytes (`-c`).
- Know about `tee` to copy from stdin to a file and also to stdout, as in `ls -al | tee file.txt`.
- Know that locale affects a lot of command line tools in subtle ways, including sorting order (collation) and performance. Most Linux installations will set `LANG` or other locale variables to a local setting like US English. But be aware sorting will change if you change locale. And know i18n routines can make sort or other commands run *many times* slower. In some situations (such as the set operations or uniqueness operations below) you can safely ignore slow i18n routines entirely and use traditional byte-based sort order, using `export LC_ALL=C`.
2015-05-20 19:39:58 +02:00
- Know basic `awk` and `sed` for simple data munging. For example, summing all numbers in the third column of a text file: `awk '{ x += $3 } END { print x }'`. This is probably 3X faster and 3X shorter than equivalent Python.
2015-05-20 18:02:38 +02:00
- To replace all occurrences of a string in place, in one or more files:
2015-06-16 17:05:32 +02:00
```sh
perl -pi.bak -e 's/old-string/new-string/g' my-files-*.txt
2015-05-20 19:20:08 +02:00
```
2015-05-22 19:46:06 +02:00
- To rename many files at once according to a pattern, use `rename`. For complex renames, [`repren`](https://github.com/jlevy/repren) may help.
2015-06-16 17:05:32 +02:00
```sh
2015-06-08 18:39:47 +02:00
# Recover backup files foo.bak -> foo:
rename 's/\.bak$//' *.bak
# Full rename of filenames, directories, and contents foo -> bar:
repren --full --preserve-case --from foo --to bar .
2015-05-20 19:20:08 +02:00
```
2015-05-20 18:02:38 +02:00
2015-05-20 19:39:58 +02:00
- Use `shuf` to shuffle or select random lines from a file.
2015-07-02 06:58:16 +02:00
- Know `sort`'s options. For numbers, use `-n`, or `-h` for handling human-readable numbers (e.g. from `du -h`). Know how keys work (`-t` and `-k`). In particular, watch out that you need to write `-k1,1` to sort by only the first field; `-k1` means sort according to the whole line. Stable sort (`sort -s`) can be useful. For example, to sort first by field 2, then secondarily by field 1, you can use `sort -k1,1 | sort -s -k2,2`.
2015-06-16 09:09:48 +02:00
- If you ever need to write a tab literal in a command line in Bash (e.g. for the -t argument to sort), press **ctrl-v** **[Tab]** or write `$'\t'` (the latter is better as you can copy/paste it).
2015-08-11 19:17:40 +02:00
- The standard tools for patching source code are `diff` and `patch`. See also `diffstat` for summary statistics of a diff and `sdiff` for a side-by-side diff. Note `diff -r` works for entire directories. Use `diff -r tree1 tree2 | diffstat` for a summary of changes. Use `vimdiff` to compare and edit files.
2015-05-20 19:39:58 +02:00
- For binary files, use `hd` for simple hex dumps and `bvi` for binary editing.
2015-05-20 19:39:58 +02:00
- Also for binary files, `strings` (plus `grep`, etc.) lets you find bits of text.
- For binary diffs (delta compression), use `xdelta3`.
2015-05-20 19:39:58 +02:00
- To convert text encodings, try `iconv`. Or `uconv` for more advanced use; it supports some advanced Unicode things. For example, this command lowercases and removes all accents (by expanding and dropping them):
2015-06-16 17:05:32 +02:00
```sh
2015-05-20 18:02:38 +02:00
uconv -f utf-8 -t utf-8 -x '::Any-Lower; ::Any-NFD; [:Nonspacing Mark:] >; ::Any-NFC; ' < input.txt > output.txt
2015-05-20 19:20:08 +02:00
```
2015-05-20 18:02:38 +02:00
2015-05-20 19:39:58 +02:00
- To split files into pieces, see `split` (to split by size) and `csplit` (to split by a pattern).
2015-05-20 18:02:38 +02:00
- To manipulate date and time expressions, use `dateadd`, `datediff`, `strptime` etc. from [`dateutils`](http://www.fresse.org/dateutils).
- Use `zless`, `zmore`, `zcat`, and `zgrep` to operate on compressed files.
2015-05-22 05:57:55 +02:00
2015-05-20 18:02:38 +02:00
## System debugging
2015-06-01 04:42:13 +02:00
- For web debugging, `curl` and `curl -I` are handy, or their `wget` equivalents, or the more modern [`httpie`](https://github.com/jakubroztocil/httpie).
- To know current cpu/disk status, the classic tools are `top` (or the better `htop`), `iostat`, and `iotop`. Use `iostat -mxz 15` for basic CPU and detailed per-partition disk stats and performance insight.
- For network connection details, use `netstat` and `ss`.
- For a quick overview of what's happening on a system, `dstat` is especially useful. For broadest overview with details, use [`glances`](https://github.com/nicolargo/glances).
2015-06-16 04:04:54 +02:00
2015-05-20 19:39:58 +02:00
- To know memory status, run and understand the output of `free` and `vmstat`. In particular, be aware the "cached" value is memory held by the Linux kernel as file cache, so effectively counts toward the "free" value.
- Java system debugging is a different kettle of fish, but a simple trick on Oracle's and some other JVMs is that you can run `kill -3 <pid>` and a full stack trace and heap summary (including generational garbage collection details, which can be highly informative) will be dumped to stderr/logs. The JDK's `jps`, `jstat`, `jstack`, `jmap` are useful. [SJK tools](https://github.com/aragozin/jvm-tools) are more advanced.
2015-05-20 19:39:58 +02:00
- Use `mtr` as a better traceroute, to identify network issues.
- For looking at why a disk is full, `ncdu` saves time over the usual commands like `du -sh *`.
2015-05-20 19:39:58 +02:00
- To find which socket or process is using bandwidth, try `iftop` or `nethogs`.
2015-05-20 19:39:58 +02:00
- The `ab` tool (comes with Apache) is helpful for quick-and-dirty checking of web server performance. For more complex load testing, try `siege`.
2015-06-16 18:36:01 +02:00
- For more serious network debugging, `wireshark`, `tshark`, or `ngrep`.
2015-05-20 19:39:58 +02:00
- Know about `strace` and `ltrace`. These can be helpful if a program is failing, hanging, or crashing, and you don't know why, or if you want to get a general idea of performance. Note the profiling option (`-c`), and the ability to attach to a running process (`-p`).
2015-05-20 19:39:58 +02:00
- Know about `ldd` to check shared libraries etc.
2015-05-20 19:39:58 +02:00
- Know how to connect to a running process with `gdb` and get its stack traces.
2015-07-24 01:45:37 +02:00
- Use `/proc`. It's amazingly helpful sometimes when debugging live problems. Examples: `/proc/cpuinfo`, `/proc/meminfo`, `/proc/cmdline`, `/proc/xxx/cwd`, `/proc/xxx/exe`, `/proc/xxx/fd/`, `/proc/xxx/smaps` (where `xxx` is the process id or pid).
2015-05-20 19:39:58 +02:00
- When debugging why something went wrong in the past, `sar` can be very helpful. It shows historic statistics on CPU, memory, network, etc.
2015-05-22 19:46:06 +02:00
- For deeper systems and performance analyses, look at `stap` ([SystemTap](https://sourceware.org/systemtap/wiki)), [`perf`](http://en.wikipedia.org/wiki/Perf_(Linux)), and [`sysdig`](https://github.com/draios/sysdig).
2015-07-03 09:22:55 +02:00
- Check what OS you're on with `uname` or `uname -a` (general Unix/kernel info) or `lsb_release -a` (Linux distro info).
2015-05-20 19:39:58 +02:00
- Use `dmesg` whenever something's acting really funny (it could be hardware or driver issues).
2015-05-20 18:02:38 +02:00
## One-liners
2015-05-22 05:57:55 +02:00
A few examples of piecing together commands:
- It is remarkably helpful sometimes that you can do set intersection, union, and difference of text files via `sort`/`uniq`. Suppose `a` and `b` are text files that are already uniqued. This is fast, and works on files of arbitrary size, up to many gigabytes. (Sort is not limited by memory, though you may need to use the `-T` option if `/tmp` is on a small root partition.) See also the note about `LC_ALL` above and `sort`'s `-u` option (left out for clarity below).
2015-06-16 17:05:32 +02:00
```sh
cat a b | sort | uniq > c # c is a union b
cat a b | sort | uniq -d > c # c is a intersect b
cat a b b | sort | uniq -u > c # c is set difference a - b
2015-05-20 19:20:08 +02:00
```
2015-06-18 00:30:51 +02:00
- Use `grep . *` to visually examine all contents of all files in a directory, e.g. for directories filled with config settings, like `/sys`, `/proc`, `/etc`.
- Summing all numbers in the third column of a text file (this is probably 3X faster and 3X less code than equivalent Python):
2015-06-16 17:05:32 +02:00
```sh
awk '{ x += $3 } END { print x }' myfile
2015-05-20 19:20:08 +02:00
```
2015-05-20 19:39:58 +02:00
- If want to see sizes/dates on a tree of files, this is like a recursive `ls -l` but is easier to read than `ls -lR`:
2015-06-16 17:05:32 +02:00
```sh
2015-05-20 19:20:08 +02:00
find . -type f -ls
```
2015-05-20 19:39:58 +02:00
- Say you have a text file, like a web server log, and a certain value that appears on some lines, such as an `acct_id` parameter that is present in the URL. If you want a tally of how many requests for each `acct_id`:
2015-06-16 17:05:32 +02:00
```sh
cat access.log | egrep -o 'acct_id=[0-9]+' | cut -d= -f2 | sort | uniq -c | sort -rn
2015-05-20 19:20:08 +02:00
```
- To continuously monitor changes, use `watch`, e.g. check changes to files in a directory with `watch -d -n 2 'ls -rtlh | tail'` or to network settings while troubleshooting your wifi settings with `watch -d -n 2 ifconfig`.
2015-06-08 18:39:47 +02:00
- Run this function to get a random tip from this document (parses Markdown and extracts an item):
2015-06-16 17:05:32 +02:00
```sh
2015-06-08 18:39:47 +02:00
function taocl() {
curl -s https://raw.githubusercontent.com/jlevy/the-art-of-command-line/master/README.md |
2015-06-14 22:37:32 +02:00
pandoc -f markdown -t html |
2015-06-08 18:39:47 +02:00
xmlstarlet fo --html --dropdtd |
xmlstarlet sel -t -v "(html/body/ul/li[count(p)>0])[$RANDOM mod last()+1]" |
xmlstarlet unesc | fmt -80
2015-06-08 18:39:47 +02:00
}
2015-06-02 20:18:42 +02:00
```
2015-05-20 19:39:58 +02:00
## Obscure but useful
2015-05-20 19:52:07 +02:00
- `expr`: perform arithmetic or boolean operations or evaluate regular expressions
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `m4`: simple macro processor
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `yes`: print a string a lot
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `cal`: nice calendar
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `env`: run a command (useful in scripts)
2015-06-02 20:18:42 +02:00
2015-06-24 01:05:53 +02:00
- `printenv`: print out environment variables (useful in debugging and scripts)
2015-05-20 19:52:07 +02:00
- `look`: find English words (or lines in a file) beginning with a string
2015-06-02 20:18:42 +02:00
2015-07-12 11:58:27 +02:00
- `cut`, `paste` and `join`: data manipulation
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `fmt`: format text paragraphs
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `pr`: format text into pages/columns
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `fold`: wrap lines of text
2015-06-02 20:18:42 +02:00
- `column`: format text fields into aligned, fixed-width columns or tables
2015-06-02 20:18:42 +02:00
- `expand` and `unexpand`: convert between tabs and spaces
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `nl`: add line numbers
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `seq`: print numbers
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `bc`: calculator
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `factor`: factor integers
2015-06-02 20:18:42 +02:00
2015-07-12 15:55:21 +02:00
- [`gpg`](https://gnupg.org/): encrypt and sign files
2015-06-16 09:09:48 +02:00
- `toe`: table of terminfo entries
2015-05-20 19:52:07 +02:00
- `nc`: network debugging and data transfer
2015-06-02 20:18:42 +02:00
- `socat`: socket relay and tcp port forwarder (similar to `netcat`)
2015-07-12 15:52:41 +02:00
- [`slurm`](https://github.com/mattthias/slurm): network trafic visualization
2015-05-20 19:52:07 +02:00
- `dd`: moving data between files or devices
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `file`: identify type of a file
2015-06-02 20:18:42 +02:00
2015-06-20 08:52:51 +02:00
- `tree`: display directories and subdirectories as a nesting tree; like `ls` but recursive
2015-05-20 19:52:07 +02:00
- `stat`: file info
2015-06-02 20:18:42 +02:00
2015-07-19 08:04:01 +02:00
- `time`: execute and time a command
- `watch`: run a command repeatedly, showing results and/or highlighting changes
2015-05-20 19:52:07 +02:00
- `tac`: print files in reverse
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `shuf`: random selection of lines from a file
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `comm`: compare sorted files line by line
2015-06-02 20:18:42 +02:00
2015-06-22 08:57:51 +02:00
- `pv`: monitor the progress of data through a pipe
2015-05-20 19:52:07 +02:00
- `hd` and `bvi`: dump or edit binary files
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `strings`: extract text from binary files
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `tr`: character translation or manipulation
2015-06-02 20:18:42 +02:00
2015-06-20 08:52:51 +02:00
- `iconv` or `uconv`: conversion for text encodings
2015-06-02 20:18:42 +02:00
- `split` and `csplit`: splitting files
2015-06-02 20:18:42 +02:00
2015-07-03 13:44:31 +02:00
- `sponge`: read all input before writing it, useful for reading from then writing to the same file, e.g., `grep -v something some-file | sponge some-file`
2015-07-03 10:13:06 +02:00
- `units`: unit conversions and calculations; converts furlongs per fortnight to twips per blink (see also `/usr/share/units/definitions.units`)
2015-08-12 08:46:17 +02:00
- `apg`: generates random passwords
2015-05-20 19:52:07 +02:00
- `7z`: high-ratio file compression
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `ldd`: dynamic library info
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `nm`: symbols from object files
2015-06-02 20:18:42 +02:00
2015-06-16 00:20:25 +02:00
- `ab`: benchmarking web servers
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `strace`: system call debugging
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `mtr`: better traceroute for network debugging
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `cssh`: visual concurrent shell
2015-06-02 20:18:42 +02:00
- `rsync`: sync files and folders over SSH or in local file system
- `wireshark` and `tshark`: packet capture and network debugging
2015-06-02 20:18:42 +02:00
2015-06-20 08:52:51 +02:00
- `ngrep`: grep for the network layer
- `host` and `dig`: DNS lookups
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `lsof`: process file descriptor and socket info
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `dstat`: useful system stats
2015-06-02 20:18:42 +02:00
2015-06-16 09:09:48 +02:00
- [`glances`](https://github.com/nicolargo/glances): high level, multi-subsystem overview
2015-06-16 04:04:54 +02:00
- `iostat`: Disk usage stats
- `mpstat`: CPU usage stats
- `vmstat`: Memory usage stats
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `htop`: improved version of top
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `last`: login history
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `w`: who's logged on
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `id`: user/group identity info
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `sar`: historic system stats
2015-06-02 20:18:42 +02:00
- `iftop` or `nethogs`: network utilization by socket or process
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `ss`: socket statistics
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `dmesg`: boot and system error messages
2015-06-02 20:18:42 +02:00
2015-07-24 22:06:08 +02:00
- `sysctl`: view and configure Linux kernel parameters at run time
2015-05-20 19:52:07 +02:00
- `hdparm`: SATA/ATA disk manipulation/performance
2015-06-02 20:18:42 +02:00
2015-05-20 19:52:07 +02:00
- `lsb_release`: Linux distribution info
2015-06-02 20:18:42 +02:00
- `lsblk`: list block devices: a tree view of your disks and disk paritions
2015-06-23 03:42:05 +02:00
- `lshw`, `lscpu`, `lspci`, `lsusb`, `dmidecode`: hardware information, including CPU, BIOS, RAID, graphics, devices, etc.
2015-06-02 20:18:42 +02:00
2015-08-04 04:22:04 +02:00
- `lsmod` and `modinfo`: List and show details of kernel modules.
2015-06-16 07:36:06 +02:00
- `fortune`, `ddate`, and `sl`: um, well, it depends on whether you consider steam locomotives and Zippy quotations "useful"
2015-07-09 19:06:56 +02:00
## MacOS X only
These are items relevant *only* on MacOS.
- Package management with `brew` (Homebrew) and/or `port` (MacPorts). These can be used to install on MacOS many of the above commands.
2015-07-03 09:52:01 +02:00
- Copy output of any command to a desktop app with `pbcopy` and paste input from one with `pbpaste`.
- To enable the Option key in Mac OS Terminal as an alt key (such as used in the commands above like **alt-b**, **alt-f**, etc.), open Preferences -> Profiles -> Keyboard and select "Use Option as Meta key".
- To open a file with a desktop app, use `open` or `open -a /Applications/Whatever.app`.
- Spotlight: Search files with `mdfind` and list metadata (such as photo EXIF info) with `mdls`.
2015-07-03 09:52:01 +02:00
- Be aware MacOS is based on BSD Unix, and many commands (for example `ps`, `ls`, `tail`, `awk`, `sed`) have many subtle variations from Linux, which is largely influenced by System V-style Unix and GNU tools. You can often tell the difference by noting a man page has the heading "BSD General Commands Manual." In some cases GNU versions can be installed, too (such as `gawk` and `gsed` for GNU awk and sed). If writing cross-platform Bash scripts, avoid such commands (for example, consider Python or `perl`) or test carefully.
2015-05-22 23:03:11 +02:00
## More resources
2015-05-22 23:05:15 +02:00
- [awesome-shell](https://github.com/alebcay/awesome-shell): A curated list of shell tools and resources.
- [Strict mode](http://redsymbol.net/articles/unofficial-bash-strict-mode/) for writing better shell scripts.
- [shellcheck](https://github.com/koalaman/shellcheck): A shell script static analysis tool. Essentially, lint for bash/sh/zsh.
- [Filenames and Pathnames in Shell](http://www.dwheeler.com/essays/filenames-in-shell.html): The sadly complex minutiae on how to handle filenames correctly in shell scripts.
2015-05-22 23:03:11 +02:00
2015-05-20 19:20:08 +02:00
## Disclaimer
2015-06-16 09:09:48 +02:00
With the exception of very small tasks, code is written so others can read it. With power comes responsibility. The fact you *can* do something in Bash doesn't necessarily mean you should! ;)
2015-06-20 18:53:07 +02:00
## License
[![Creative Commons License](https://i.creativecommons.org/l/by-sa/4.0/88x31.png)](http://creativecommons.org/licenses/by-sa/4.0/)
2015-06-20 18:53:07 +02:00
This work is licensed under a [Creative Commons Attribution-ShareAlike 4.0 International License](http://creativecommons.org/licenses/by-sa/4.0/).