go-git - A highly extensible Git implementation in pure Go.

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	packfile: improve Index memory representation to be more compact	David Symonds	2018-05-30	2	-23/+67
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Instead of using a map for offset indexing, use a sorted slice. Binary searching is fast, and a slice is much more compact. This has a negligible hit on speed, but has a significant impact on memory usage, especially for larger repos. benchmark old ns/op new ns/op delta BenchmarkIndexConstruction-12 15506506 14056098 -9.35% benchmark old allocs new allocs delta BenchmarkIndexConstruction-12 60764 60385 -0.62% benchmark old bytes new bytes delta BenchmarkIndexConstruction-12 4318145 3913169 -9.38% Signed-off-by: David Symonds <dsymonds@golang.org>
*	*: Use CheckClose with named returns	Javi Fontan	2018-03-27	2	-4/+4
\| \| \| \| \| \| \| \|	Previously some close errors were losts. This is specially problematic in go-git as lots of work is done here like generating indexes and moving packfiles. Signed-off-by: Javi Fontan <jfontan@gmail.com>
*	*: skip time consuming tests	Máximo Cuadros	2018-03-21	1	-0/+9
\| \| \| \|	Signed-off-by: Máximo Cuadros <mcuadros@gmail.com>
*	plumbing: format/packfile, add SaveOriginalMetadata function	Javi Fontan	2018-02-09	2	-5/+9
\| \| \| \|	Signed-off-by: Javi Fontan <jfontan@gmail.com>
*	plumbing: format/packfile, fix panic retrieving object hash.	Javi Fontan	2018-02-09	3	-4/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In some cases the original data is not saved before it is cleaned and forces a panic when it's needed. The change adds ObjectToPack.CleanOriginal to be used to clean original object instead of: object.Original = nil Now when the Original data is freed because it's no longer in the pack window a SetOriginal call is done to make sure that Size, Hash and Size data is not lost. Signed-off-by: Javi Fontan <jfontan@gmail.com>
*	plumbing: format/packfile, check nil objects in ObjectToPack	Javi Fontan	2018-01-25	2	-8/+12
\| \| \| \| \| \| \|	SetOriginal now skips setting resolved values if the provided object is nil. BackToOriginal also skips nil Original objects. Signed-off-by: Javi Fontan <jfontan@gmail.com>
*	plumbing: format/packfile, fix crash with cycle deltas	Javi Fontan	2018-01-24	4	-1/+51
\| \| \| \| \| \| \| \| \| \| \| \|	Resolving cycles relied on ObjectToPack objects having Original. This is no longer true with the changes from #720. This commit changes: * Save original type, hash and size in ObjectToPack * Use SetObject to set both Original and resolved type, hash and size * Restore original object before using BackToOriginal (cycle resolution) * Update encoder test to check this case Signed-off-by: Javi Fontan <jfontan@gmail.com>
*	plumbing: packfile, Add crc check to scanner test.	Javi Fontan	2018-01-21	1	-4/+75
\| \| \| \|	Signed-off-by: Javi Fontan <jfontan@gmail.com>
*	plumbing: packfile, Add a buffer to crc writer.	Javi Fontan	2018-01-21	1	-9/+31
\| \| \| \| \| \| \| \| \| \| \|	crc update with block smaller than 16 bytes uses a slower version of the function. ReadByte is heavily used by zlib inflate so most of the time crc is update byte by byte. A new Flush method is added to the scanner to flush this crc writer cache. It is only called when the Scanner reader is a teeReader. Signed-off-by: Javi Fontan <jfontan@gmail.com>
*	Modify cache to delete more than one item to free space	Javi Fontan	2018-01-16	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \|	The previous version could only delete the oldest used object. If the object to cache was bigger than the space freed it could not be added. Also the decoder adds bases to the cache when they are needed. This change increases the speed creating indexes 2x. Signed-off-by: Javi Fontan <jfontan@gmail.com>
*	Clean reconstructed objects outside pack window	Javi Fontan	2018-01-11	1	-13/+19
\| \| \| \| \| \| \| \| \| \| \|	Object walk reconstructs delta objects but these are not cleaned up after they got out the pack window. Without this change all reconstructed objects reside in memory. restoreOriginal call is moved before calling Size(). Now we can not guarantee that the object is already undeltified. Signed-off-by: Javi Fontan <javier@sourced.tech>
*	Merge pull request #698 from jfontan/improvement/use-decoder-cache	Máximo Cuadros	2017-12-20	2	-17/+39
\|\ \| \| \| \|	plumbing: cache, enforce the use of cache in packfile decoder
\| *	Make DeltaBaseCache private	Javi Fontan	2017-12-20	1	-6/+13
\| \| \| \| \| \| \| \|	Signed-off-by: Javi Fontan <jfontan@gmail.com>
\| *	Fix typo and documentation of NewDecoderForType	Javi Fontan	2017-12-20	1	-3/+3
\| \| \| \| \| \| \| \|	Signed-off-by: Javi Fontan <jfontan@gmail.com>
\| *	Enforce the use of cache in packfile decoder	Javi Fontan	2017-12-20	2	-12/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Decoder object can make use of an object cache to speed up processing. Previously the only way to specify it was changing manually the struct generated by NewDecodeForFile. This lead to some instances to be created without it and penalized performance. Now the cache should be explicitly passed to the constructor function. NewDecoder now creates objects with a cache using the default size. A new helper function was added to create cache objects with the default size as this becomes a common task now: cache.NewObjectLRUDefault() Signed-off-by: Javi Fontan <jfontan@gmail.com>
* \|	Improve delta reutilization	Antonio Jesus Navarro Perez	2017-12-20	5	-29/+152
\|/ \| \| \| \| \| \| \| \| \| \|	- Remove wrong 'if' on delta selector that causes poor delta reutilizations - packfile.Encoder now can write deltas and objects in a non specific order - ObjectToPack now saves the Offset on the packfile to be able to obtain base offset in a recursive manner and write them before the delta itself - Added encoder test to check cyclic delta chains - Check the output packfile hash in all encoder tests Signed-off-by: Antonio Jesus Navarro Perez <antnavper@gmail.com>
*	all: gofmt -s	ferhat elmas	2017-11-30	2	-2/+2
\|
*	all: simplification	ferhat elmas	2017-11-29	4	-14/+5
\| \| \| \| \| \| \| \| \| \|	- no length for map initialization - don't check for boolean/error return - don't format string - use string method of bytes buffer instead of converting bytes to string - use `strings.Contains` instead of `strings.Index` - use `bytes.Equal` instead of `bytes.Compare`
*	update to go-billy.v4 and go-git-fixtures.v3	Máximo Cuadros	2017-11-23	4	-6/+5
\| \| \| \|	Signed-off-by: Máximo Cuadros <mcuadros@gmail.com>
*	format: packfile fix DecodeObjectAt when Decoder has type	Máximo Cuadros	2017-11-19	2	-4/+29
\| \| \| \|	Signed-off-by: Máximo Cuadros <mcuadros@gmail.com>
*	Merge pull request #631 from keybase/strib/use-bytes-pool-for-diffs	Máximo Cuadros	2017-10-31	1	-4/+13
\|\ \| \| \| \|	packfile: use buffer pool for diffs
\| *	packfile: use buffer pool for diffs	Jeremy Stribling	2017-10-30	1	-4/+13
\| \|
* \|	packfile: delete index maps from memory when no longer needed	Jeremy Stribling	2017-10-30	1	-0/+6
\|/ \| \| \|	This helps keep memory usage stable while calculating deltas.
*	config: support a configurable, and turn-off-able, pack.window	Jeremy Stribling	2017-09-11	5	-37/+89
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	One use of go-git is to transfer git data from a non-standard git repo (not stored in a file system, for example) to a "remote" backed by a standard, local .git repo. In this scenario, delta compression is not needed to reduce transfer time over the "network", because there is no network. The underlying storage layer has already taken care of the data tranfer, and sending the objects to local .git storage doesn't require compression. So this PR gives the user the option to turn off compression when it isn't needed. Of course, this results in a larger, uncompressed local .git repo, but the user can then run git gc or git repack on that repo if they care about the storage costs. Turning the pack window to 0 on reduces total push time of a 36K repo by 50 seconds (out of a pre-PR total of 3m26s).
*	packfile: small optimizations for findMatch and matchLength	Miguel Molina	2017-09-07	2	-16/+38
\| \| \| \|	Signed-off-by: Miguel Molina <miguel@erizocosmi.co>
*	packfile: parallelize deltification of objects in groups	Miguel Molina	2017-09-07	2	-21/+31
\| \| \| \|	Signed-off-by: Miguel Molina <miguel@erizocosmi.co>
*	packfile: use a modified version of JGit DeltaIndex and DeltaIndexScanner	Miguel Molina	2017-09-06	3	-40/+286
\| \| \| \|	Signed-off-by: Miguel Molina <miguel@erizocosmi.co>
*	packfile: slightly haster hash function for chunk-offset index key	Miguel Molina	2017-09-06	2	-11/+32
\| \| \| \|	Signed-off-by: Miguel Molina <miguel@erizocosmi.co>
*	packfile: reuse delta indexes when possible	Miguel Molina	2017-09-06	3	-10/+22
\| \| \| \|	Signed-off-by: Miguel Molina <miguel@erizocosmi.co>
*	packfile: improve performance a little by reducing gc pressure	Miguel Molina	2017-09-04	4	-6/+33
\| \| \| \|	Signed-off-by: Miguel Molina <miguel@erizocosmi.co>
*	plumbing: use sliding window in delta calculations, like git CL	Jeremy Stribling	2017-08-28	2	-1/+23
\| \| \| \| \| \| \| \| \| \|	This sets a default sliding window of 10 for the delta calculation, just like git CL: https://git-scm.com/docs/git-pack-objects#git-pack-objects---windowltngt For a big-ish repo with 35K objects (17K commits), this reduced the time for calling `deltaSelection.walk` during a push from more than 14 minutes to about a minute.
*	plumbing: add IsDelta method to ObjectType	Santiago M. Mola	2017-07-27	1	-6/+2
\| \| \| \| \|	ObjectType.IsDelta is a convenience function to match both offset and reference delta types.
*	storage: reuse deltas from packfiles	Santiago M. Mola	2017-07-27	5	-98/+293
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* plumbing: add DeltaObject interface for EncodedObjects that are deltas and hold additional information about them, such as the hash of the base object. * plumbing/storer: add DeltaObjectStorer interface for object storers that can return DeltaObject. Note that calls to EncodedObject will never return instances of DeltaObject. That requires explicit calls to DeltaObject. * storage/filesystem: implement DeltaObjectStorer interface. * plumbing/packfile: packfile encoder now supports reusing deltas that are already computed (e.g. from an existing packfile) if the storage implements DeltaObjectStorer. Reusing deltas boosts performance of packfile generation (e.g. on push).
*	test: improve packfile.Encoder tests	Santiago M. Mola	2017-07-27	1	-54/+68
\| \| \| \| \| \| \|	* Improve checks for encode/decode. * Make it easier to extend this test with more storage backends.
*	storage/filesystem: reuse delta cache	Santiago M. Mola	2017-07-27	1	-10/+22
\| \| \| \| \|	Reuse delta base object cache for packfile decoders across multiple instances.
*	plumbing/cache: use more explicit interface	Santiago M. Mola	2017-07-27	1	-7/+6
\| \| \| \| \| \|	* renamed Add to Put * Get returns a second bool value to indicate if there was hit or miss.
*	packfile: create packfile.Index and reuse it	Santiago M. Mola	2017-07-26	4	-82/+279
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There was an internal type (i.e. storage/filesystem.idx) to use as in-memory index for packfiles. This was not convenient to reuse in the packfile. This commit creates a new representation (format/packfile.Index) that can be converted to and from idxfile.Idxfile. A packfile.Index now contains the functionality that was scattered on storage/filesystem.idx and packfile.Decoder's internals. storage/filesystem now reuses packfile.Index instances and this also results in higher cache hit ratios when resolving deltas.
*	Merge pull request #491 from smola/error-checks	Máximo Cuadros	2017-07-19	2	-21/+22
\|\ \| \| \| \|	*: add more IO error checks
\| *	packfile: check close error in UpdateObjectStorage	Santiago M. Mola	2017-07-19	1	-8/+14
\| \|
\| *	packfile: check zlib reader closing error	Santiago M. Mola	2017-07-19	1	-13/+8
\| \|
* \|	packfile: Avoid panics patching corrupted deltas	Antonio Jesus Navarro Perez	2017-07-19	2	-16/+85
\|/
*	*: upgrade to go-billy.v3, merge	Máximo Cuadros	2017-06-18	1	-1/+1
\|
*	packfile: A copy operation cannot be bigger than 64kb	Antonio Jesus Navarro Perez	2017-06-01	2	-1/+35
\| \| \| \|	More info here: https://github.com/git/git/blob/f7466e94375b3be27f229c78873f0acf8301c0a5/diff-delta.c#L428
*	format/packfile: improve binary delta algorithm	Antonio Jesus Navarro Perez	2017-05-24	2	-441/+94
\| \| \| \|	Implemented algorithm described in "File System Support for Delta Compression" paper, from "Joshua P. MacDonald".
*	format/packfile: fix bug when the delta depth is equals to 50	Antonio Jesus Navarro Perez	2017-05-08	2	-0/+10
\|
*	format/packfile: rename Seek to SeekFromStart	Santiago M. Mola	2017-04-26	3	-6/+8
\| \| \| \| \| \|	This has signature and behavior distinct from io.Seeker, go vet complains about this, so we change it to a different name to avoid confusion.
*	use go-billy.v2 version	Antonio Jesus Navarro Perez	2017-03-10	1	-1/+1
\|
*	project: move imports from srcd.works to gopkg.in	Antonio Jesus Navarro Perez	2017-03-07	14	-28/+28
\| \| \| \|	To be able to fix #261 we will move again to gopkg.in before v4 stable release.
*	plumbing: improve documentation (Fix #242)	Antonio Jesus Navarro Perez	2017-02-23	3	-5/+11
\|
*	cache: move package to plumbing	Antonio Jesus Navarro Perez	2017-02-21	1	-1/+1
\| \| \| \|	Because cache package is only intended to be used at internal level, we move it to the plumbing package.