3630 – hg convert crashes on largefiles

Bug 3630 - hg convert crashes on largefiles

Summary: hg convert crashes on largefiles

Status:	RESOLVED FIXED

Alias:	None

Product:	Mercurial
Classification:	Unclassified
Component:	convert (show other bugs)
Version:	unspecified
Hardware:	All All

Importance:	normal bug
Assignee:	Bugzilla

URL:
Keywords:

Depends on:
Blocks:

Reported:	2012-09-18 09:32 UTC by Christian Tismer
Modified:	2012-12-22 18:53 UTC (History)
CC List:	6 users (show)

See Also:
Python Version:	---

Attachments
Add an attachment (proposed patch, testcase, etc.)

Note You need to log in before you can comment on or make changes to this bug.

Description Christian Tismer 2012-09-18 09:32 UTC

When converting a hg repos that uses the largefiles extension,
The following happens:

- the conversion starts and runs for a while

- it crashes because of wrong content of the (dummy) 00changelog.i

Then a partially repos is left in some intermediate state.

Resolution (hard):
- write largefiles support

Resolution (easy):
- check if largefiles is active and bail out with an error message.

cheers - Chris

Comment 1 kiilerix 2012-09-18 15:19 UTC

AFAICS it fails decently ... but with room for improvement:

--- a/tests/test-lfconvert.t
+++ b/tests/test-lfconvert.t
@@ -4,6 +4,7 @@
   > share =
   > graphlog =
   > mq =
+  > convert=
   > [largefiles]
   > minsize = 0.5
   > patterns = **.other
@@ -270,3 +271,16 @@
   pass
 
   $ cd ..
+
+  $ hg convert largefiles-repo
+  assuming destination largefiles-repo-hg
+  initializing destination largefiles-repo-hg repository
+  scanning source...
+  sorting...
+  converting...
+  6 add large, normal1
+  abort: No such file or directory: $TESTTMP/largefiles-repo-hg/.hglf/large
+  [255]
+  $ hg -R largefiles-repo-hg glog --template "{rev}:{node|short}  {desc|firstline}\n"
+  o  0:d4892ec57ce2  add large, normal1


If you see a real crash then please show the actual output ... and preferably also a full test case.

Comment 2 Christian Tismer 2012-09-30 18:15 UTC

I will try to find a traceback.

My error messy was really a complaint about 00changelog.i
which exists in the place where normal repos expect it, but with a syntax changed
for largefiles (just a "wrong" message note), while the true, new
00changelog.i is in a different location.

The dummy file shows

minimax:didoca tismer$ hexdump -C .hg/00changelog.i 
00000000  00 00 00 02 20 64 75 6d  6d 79 20 63 68 61 6e 67  |.... dummy chang|
00000010  65 6c 6f 67 20 74 6f 20  70 72 65 76 65 6e 74 20  |elog to prevent |
00000020  75 73 69 6e 67 20 74 68  65 20 6f 6c 64 20 72 65  |using the old re|
00000030  70 6f 20 6c 61 79 6f 75  74                       |po layout|
00000039

This is a syntax that hg convert does not expect.
And it is not aware of this possibility.

The real one is in

.hg/store/00changelog.i

I think, hg convert should at least probe this and bail out with an error message
that it cannot handle this.

Comment 3 Matt Mackall 2012-09-30 18:29 UTC

If Mercurial ever tries to read that file, you've either

a) got an ancient version of Mercurial
b) have a missing or damaged 'requires' file

Comment 4 Christian Tismer 2012-09-30 19:08 UTC

Not accepted:

- Mercurial was the latest official version at the time of reporting

- I was never asked by anything to have a 'requires' file

and in any case, this argumentation makes little sense.
Even if I made errors (which I seem didn't), an application
should never crash with a random error message, but check
if it will be able to run
instead of running a bit, then crashing and leaving the new repos in
some unusable state.
In any case, to recover gracefully is a must (talking as a python core developer).

I don't see the point in your argument.
I was hit by this bug, and two colleagues on a different platform as well.
What are you trying to defend, instead of being happy that somebody cares?

Your arguments "this is either or" seems to be incorrect.

No offense, just a reaction to absolute statements.

cheers - Chris

Comment 5 Matt Mackall 2012-09-30 21:41 UTC

We need more information.

Do you have a .hg/requires file or not?

Attach a traceback.

Comment 6 Christian Tismer 2012-10-01 16:05 UTC

I still have the unconverted repos on my machine.

Yes, there is a requires file with this content:

largefiles
revlogv1
fncache
store
dotencode


There is no traceback, just an abort message.
First and last couple of lines of the log:

macgyver:didoca.off tismer$ hg convert --filemap private/private.filemap.txt . ../tmp.private
initializing destination ../tmp.private repository
scanning source...
sorting...
converting...
350 blah
349 initial checkin for reading IBF- and CDB-/-RDB-files, some refactoring needed
348 Veit's Skizze
347 merge
 (... lines skipped)
17 changed language of the documentation to german
16 converted file from latin1 to utf-8
15 Switched template editor to configuration
14 Added links
13 added description of files which has to be moved to the private repository
12 moved files to private directory
transaction abort!
rollback completed
abort: integrity check failed on 00changelog.i:67!
macgyver:didoca.off tismer$ 


Unfortunately I cannot send a copy of the repos, because it contains
private data (actually the reason why we splitted).

cheers - chris

Comment 7 Matt Mackall 2012-10-01 19:57 UTC

Ok, then .hg/00changelog.i is completely unrelated... and is not being read.
It's a placeholder to cause old versions of Mercurial to not attempt to read repos.

The interesting file is probably .hg/store/00changelog.i ... but it's probably empty again by the time the rollback is complete.

Comment 8 andcycle-bz.selenic.com 2012-10-02 06:31 UTC

I just need to rescue my corrupted repository then I encounter this this problem,
actually it's not crash,
here is my example to reproduce this issue


51G demo # mkdir orig; pushd orig; hg init; truncate -s 1m a; hg add a; hg commit -m 'orig'; popd;
/tmp/demo/orig /tmp/demo
/tmp/demo
51G demo # hg lfconvert -s 1 orig lfrepo;
initializing destination lfrepo
51G demo # hg convert lfrepo normrepo;
initializing destination normrepo repository
scanning source...
sorting...
converting...
0 orig
abort: No such file or directory: /tmp/demo/normrepo/.hglf/a
51G demo #

Comment 9 Matt Mackall 2012-10-19 14:34 UTC

Closing, gone cold. Last message unrelated to this bug.

Comment 10 Matt Harbison 2012-10-23 00:26 UTC

Reopening.  The last comment may be about 3519, which is fixed, but this is definitely a problem.  The changelog.i file seems irrelevant.  Patches submitted here:

http://www.selenic.com/pipermail/mercurial-devel/2012-October/045620.html

Comment 11 HG Bot 2012-10-27 18:14 UTC

Fixed by http://selenic.com/repo/hg/rev/92bbb21d4b13
Matt Harbison <matt_harbison@yahoo.com>
largefiles: respect the rev when reading standins in copytostore() (issue3630)

When the rev isn't specified, the standin for the working copy gets read.  But
convert doesn't update the working copy for each cset it processes, so there is
no standin and the 'hg convert' would abort complaining about the standin being
missing.

Note that if the largefile is not in the user cache, 'hg convert' complains
about the largefile itself missing from the destination repo.

(please test the fix)