Mercurial File Formats

This page is a starting point for understanding Mercurial file internals. Text files are marked as "local" if they are in the local charset, otherwise "utf-8" or "ascii". Files intended to be user-editable are marked "editable". All files are optional unless otherwise noted.

1. Bundle files

Bundle files use the WireProtocol format, with a header indicating compression type:

2. Files in the working directory

2.1. .hgignore (local editable)

This lists ignore patterns. May or may not be managed. See hgignore manpage.

Source: mercurial/ignore.py

2.2. .hgtags (utf-8 editable)

This is a managed file containing tags. Effective tags are determined by combining information from this file on all project heads (not from the working directory).

One line per tag in the format '<full hex changeset id> <tag>'.

Source: mercurial/tags.py

2.3. .hgsigs (ascii) [gpg]

This file contains changeset signatures from the gpg extension, one per line. Format:

<full hex changeset id> <signature version> <base64-encoded signature>

Source: hgext/gpg.py

2.4. .hg/

This directory in the repository root contains all the revision history and non-versioned configuration data.

3. Files in the repository directory (.hg)

3.1. 00changelog.i

In repositories with the 'store' requirement, this is a placeholder to warn older Mercurial about version mismatch.

3.2. requires (ascii)

If this file exists, it contains one line per feature that a local Mercurial must support to read the repository. See RequiresFile.

3.3. bookmarks

This file stores Bookmarks. Same format as .hgtags: one bookmark per line, in the format:

<full hex changeset id> <bookmark>

3.4. bookmarks.current

This file contains a single line with the active bookmark name. If there is no active bookmark, this file won't exist.

3.5. branch (utf-8)

This file contains a single line with the branch name for the branch in the working directory. If it doesn't exist, the branch is \'\' (aka 'default').

Source: mercurial/dirstate.py:_branch

3.6. branch.cache (utf-8)

First line is:

<full hex id of tip> <revision number of tip>

This line is used to determine if the cache is current.

Remaining lines are:

<full hex id of branch tip> <branch name>

3.7. dirstate

This file contains information on the current state of the working directory in a binary format. It begins with two 20-byte hashes, for first and second parent, followed by an entry for each file. Each file entry is of the following form:

<1-byte state><4-byte mode><4-byte size><4-byte mtime><4-byte name length><n-byte name>

If the name contains a null character, it is split into two strings, with the second being the copy source for move and copy operations.

If the dirstate file is not present, parents are assumed to be (null, null) with no files tracked.

Source: mercurial/parsers.py:parse_dirstate()

3.8. hgrc (local editable)

This is the repository-local configuration file. See .hgrc for more info.

3.9. inotify.sock [inotify]

This is a socket created by the inotify extension to communicate with its daemon.

3.10. localtags (local editable)

File containing local tags. Same format as .hgtags.

3.11. patches/ [mq]

This directory contains mq's patch management data

3.12. wlock

The lock file for the working directory state

Source: mercurial/lock.py, mercurial/localrepo.py:wlock()

3.13. journal.*

These files are backups of files before the beginning of a transaction used to restore earlier state on failure:

3.14. undo.*

Files from last transaction to allow rollback

4. Files in the repository store (.hg or .hg/store)

The following files are stored under .hg/store in repos with the store requirement, otherwise in .hg

4.1. lock

The lock file for the repository store

Source: mercurial/lock.py, mercurial/localrepo.py:lock()

4.2. journal

The journal file is a text file containing one entry per line of the form:

<filename> <pre-modified length>

This file allows mercurial to undo changes to revlogs. If this file exists, a transaction is in progress or has been interrupted.

Source: mercurial/transaction.py

4.3. undo

Renamed journal to allow rollback after transaction is complete.

Source: mercurial/localrepo.py:rollback()

4.4. 00changelog.[id]

The project changelog, stored in Revlog format.

Source: mercurial/changelog.py

4.5. 00manifest.[id]

The project manifest, stored in Revlog format. Each manifest revision contains a list of the file revisions in each changeset, in the form:

<filename>\0<hex file revision id>[<flags>]\n

Source: mercurial/parsers.c:parse_manifest()

4.6. fncache

For the fncache repository format Mercurial maintains a new file 'fncache' (thus the name of the format) inside '.hg/store'. The fncache file contains the paths of all filelog files in the store as encoded by mercurial.filelog.encodedir. The paths are separated by '\n' (LF).

4.7. data/

Revlogs for each file in the project history. Names are escaped in various increasingly-complex ways:

Metadata may be stored at the start of each revision in a file revlog. If any metadata is there, the file contents will start with '\1\n', after which an arbitrary list of metadata pairs will follow, in '%k: %v\n' format. After that, another '\1\n' sequence follows to denote the start of the content.

5. See also


CategoryInternals

FileFormats (last edited 2013-04-26 16:17:21 by KevinBullock)