Managing large files such as audio, video and graphics files has always been one of the shortcomings of Git. The general recommendation is to not have Git repositories larger than 1GB to preserve performance.
GitSwarm already supports managing large files with git annex (EE only), however in certain environments it is not always convenient to use different commands to differentiate between the large files and regular ones.
Git LFS makes this simpler for the end user by removing the requirement to learn new commands.
Git LFS client talks with the GitSwarm server over HTTPS. It uses HTTP Basic Authentication to authorize client requests. Once the request is authorized, Git LFS client receives instructions from where to fetch or where to push the large file.
Documentation for GitSwarm instance administrators is under LFS administration doc.
Important: Git LFS is not compatible with Helix mirroring, and is currently not supported.
Lets take a look at the workflow when you need to check large files into your Git repository with Git LFS. For example, if you want to upload a very large file and check it into your Git repository:
git clone [email protected]:group/project.git
git lfs install # initialize the Git LFS project project
git lfs track "*.iso" # select the file extensions that you want to treat as large files
Once a certain file extension is marked for tracking as a LFS object you can use Git as usual without having to redo the command to track a file with the same extension:
cp ~/tmp/debian.iso ./ # copy a large file into the current directory
git add . # add the large file to the project
git commit -am "Added Debian iso" # commit the file meta data
git push origin master # push the git repo and large file to the GitSwarm server
Cloning the repository works the same as before. Git automatically detects the LFS-tracked files and clones them via HTTP. If you performed the git clone command with a SSH URL, you have to enter your GitSwarm credentials for HTTP authentication.
git clone [email protected]:group/project.git
If you already cloned the repository and you want to get the latest LFS object that are on the remote repository, eg. from branch master
:
git lfs fetch master
There are a couple of reasons why this error can occur:
Check if you have permissions to push to the project or fetch from the project.
LFS object you are trying to push to the project or fetch from the project is not available to the project anymore. Probably the object was removed from the server.
Git LFS logs the failures into a log file. To view this log file, while in project directory:
git lfs logs last
If the status error 501
is shown, it is because:
Git LFS support is not enabled on the GitSwarm server. Check with your GitSwarm administrator why Git LFS is not enabled on the server. See LFS administration documentation for instructions on how to enable LFS support.
Git LFS client version is not supported by GitSwarm server. Check your Git LFS version with git lfs version
. Check the Git config of the project for traces of deprecated API with git lfs -l
. If batch = false
is set in the config, remove the line and try to update your Git LFS client. Only version 1.0.1 and newer are supported.
If you push a LFS object to a project and you receive an error similar to: Post <URL>/info/lfs/objects/batch: dial tcp IP: getsockopt: connection refused
, the LFS client is trying to reach GitSwarm through HTTPS. However, your GitSwarm instance is being served on HTTP.
This behaviour is caused by Git LFS using HTTPS connections by default when a lfsurl
is not set in the Git config.
To prevent this from happening, set the lfs url in project Git config:
git config --add lfs.url "http://gitswarm.example.com/group/project.git/info/lfs"
Given that Git LFS uses HTTP Basic Authentication to authenticate the user pushing the LFS object on every push for every object, user HTTPS credentials are required.
By default, Git has support for remembering the credentials for each repository you use. This is described in Git credentials man pages.
For example, you can tell Git to remember the password for a period of time in which you expect to push the objects:
git config --global credential.helper 'cache --timeout=3600'
This remembers the credentials for an hour after which Git operations require re-authentication.
If you are using OS X you can use osxkeychain
to store and encrypt your credentials. For Windows, you can use wincred
or Microsoft's Git Credential Manager for Windows.
More details about various methods of storing the user credentials can be found on Git Credential Storage documentation.