Note:

This page is primarily intended for developers of Mercurial.

Note:

This page appears to contain material that is no longer relevant. Please help improve this page by updating its content.

Subrepo Work

Proposed improvements and to-do list for subrepos.

1. Internals

The .hgsub format uses the hgrc config format. It reserves a source prefix of [ for future expansion (see below). Future expansion may also used named sections in this file.

The .hgsubstate format is similar to the tags format, in the form <revision><space><path>. This file is not intended to be hand-edited, but will accept any identifier format that Mercurial accepts. It is also automatically merged when necessary. It is separated from .hgsub to keep automatic updates from muddling that file and to keep .hgsub's history tidy. The combined state can be viewed with hg debugsub.

Internally, subrepo states are represented as a hash of path to (source, revision) pairs that combine the elements of the above two files. There is also a new subrepo object type that exposes a limited set of operations on a subrepo. Subrepos can be traversed like this:

# check whether subrepos are dirty
c = repo['tip']
for s in c.substate:
    subrepo = c.sub[s]
    print s, subrepo.dirty()

1.1. Wanted Feature per Command

The following repo structure will be used to illustrate the expected behavior of the commands:

A
  B
  C
    D

So A has two subrepos (B and C), and C has D as a subrepo.

1.1.1. add

Add have an argument -S which makes it behave transparently with respect to subrepos. For example, supposte the user is in repo A and executes this command:

$ hg add -S

The effect is that files are added in repos A, B, C and D. Suppose the user executes this command when his current directory is A:

$ hg add -S C/D/foo.c

The effect is that foo.c is added in repo D.

1.1.2. archive

Archive should archive a revision in the current repo and recurse through subrepos, archiving an appropriate revision in each.

1.1.3. branch

Current behaviour: A new branch is created in repo A

$ hg branch newBranch

A change is made and committed in repo A. The changeset has been added to newBranch. A change is made and committed in repo B, C or D. The changeset has been added to default branch in the subrepo.

Should the creation of branches in subrepos be a concious decision?

$ hg branch -r newBranch

or should the branch command recusively create branches by default?

Desired behaviour: A new branch is created in repo A

$ hg branch newBranch

A change is made and committed in repo A. The changeset has been added to newBranch. A change is made and committed in repo B, C or D. The changeset has been added to newBranch in the subrepo.

1.1.4. bundle/unbundle

Bundle should bundle changesets in the current repo and recurse through subrepos, bundling appropriate changesets in each. It is unclear if these bundles should be wrapped up into a super-bundle (a zip- or tar-file?)

Unbundle should handle whatever bundle produces.

1.1.5. commit

Commit currently recurses by default. Its default behavior should be consistent with the behavior of other commands regarding subrepo handling.

2.0 changed this such that commit now defaults to aborting if subrepos are dirty. This allows it to be more consistent with other commands regarding subrepo handling.

1.1.6. diff

Diff should recurse through subrepos. It is important that the correct revisions are looked up in each subrepo. For example, one should be able to execute hg diff -r 3 -r 10 in repo A and, according to the .hgsubstates in A and C, appropriate diffs will be done in each of the subrepos.

1.1.7. export/import

Export should export changesets in the current repo and recurse through subrepos, exporting appropriate changesets from each. Import should handle whatever export produces.

1.1.8. incoming/outgoing

Incoming and outgoing can recourse into sub repositories by using the -S/--subrepo argument. The changesets are shown for each repository.

Currently, outgoing's behavior is inconsistent with push: the latter will push changes that outgoing does not show.

1.1.9. log

The log for subrepos should be interleaved with the log for the parent, according to the .hgsubstate. Options such as "limit" are applied per repo.

1.1.10. pull

Pull should recurse into subrepos. That is, it should simply do a pull in each subrepo recursively to bring them up to date. This is not to say that they should be updated at the same time. The current mechanism to do that, by updating the parent if fine when one want to update to new revisions of the subrepos based on the state of the parent's .hgsubstate

1.1.11. push

Push already recurses by default. The behavior should be consistent with other commands with respect to subrepo handling.

1.1.12. rollback

Rollback should recurse into subrepos. Do the .hgsubstate file need a revert as well since it is modified by commit?

1.1.13. root

Should not recurse, but can be extended to report the top-level repository, not just the root of the current repositroy. With this extension, aliases for other commands could be easily defined (following the UNIX approach of simplicity):

[alias]
commit-all = !hg -R $(hg root --parent) commit --sub
status-all = !hg -R $(hg root --parent) status --sub
diff-all = !hg -R $(hg root --parent) diff --sub
in-all = !hg -R $(hg root --parent) incoming --sub
out-all = ...

Probably it won't be much more effort to allow the --parent option on the listed commands directly - avoiding the alias.

1.1.14. status

Status should recurse into subrepos. Subrepos containing uncommitted changes should be listed, and a subrepo that does not have uncommitted changes but is not at the revision specified in its parent repo should also be listed.

For example, suppose the user wants to see the status starting in A. Repo B contains no uncommitted changes but is not at the revision specified in A's .hgsubstate. Repos C and D contain uncommitted changes. The output could look like this

hg status
M B/
M C/
M C/file1.txt
M C/D/
A C/D/file2.txt

1.1.15. tip

Not yet reviewed: http://mercurial.markmail.org/thread/lcbbb7xkm6qjnwhq

When the -S/--subrepos argument is used for the tip command. It recurses through the sub repositories and shows the tip of eachs. The templates support {subrepo} to show the path of the subrepo, relative to the main repo.

hg tip -S

1.1.16. update

Update currently recurses by default. The default behavior should be consistent with other commands with respect to subrepo handling.

1.2. To Do


CategoryDeveloper CategoryNewFeatures

SubrepoWork (last edited 2012-11-06 23:42:03 by abuehl)