A sidecar app which clones a git repo and keeps it in sync with the upstream.
Go to file
Tim Hockin a2c685b4f8 update RELEASING 2022-03-17 12:05:06 -07:00
_test_tools Make sure all code files have headers 2021-05-27 08:45:36 -07:00
build Use vendor and build automatic build cache 2019-03-25 08:54:44 -07:00
cmd/git-sync Merge pull request #499 from sed-i/feature/support_repo_change 2022-03-10 17:08:08 -08:00
demo Update deployment to use new version which is multi-arch 2020-01-07 22:21:00 +00:00
docs Add GIT_SYNC_ROOT change for non-root user in docs/ssh (#354) 2021-03-24 21:55:29 -07:00
pkg Set $GITSYNC_HASH in exechook 2022-02-06 16:26:04 -08:00
vendor Bump go.mod to 1.17 2022-01-28 20:10:20 -08:00
.gitallowed Move sync-hook to after symlink flip 2021-04-05 17:04:44 -07:00
.gitignore Copy all licenses into the container image 2021-03-29 16:41:57 -07:00
CONTRIBUTING.md Change CONTRIBUTING.md to CNCF CLA 2020-12-23 10:57:36 -08:00
Dockerfile.in Change libcurl workaround to be less precise 2021-05-29 09:17:16 -07:00
LICENSE Initial commit 2016-08-20 14:09:57 -07:00
Makefile update the base image to fix vulnerabilities 2022-03-10 16:18:56 -08:00
OWNERS Add Janet and Nan as approvers 2021-06-23 09:57:02 -07:00
README.md README: Break flags into sections 2022-03-08 09:06:57 -08:00
RELEASING.md update RELEASING 2022-03-17 12:05:06 -07:00
SECURITY.md Add SECURITY.md 2020-07-27 16:00:54 -06:00
SECURITY_CONTACTS Update embargo doc link in SECURITY_OWNERS and changes PST to PSC 2019-03-08 10:23:48 -07:00
askpass_git.sh Make sure all code files have headers 2021-05-27 08:45:36 -07:00
code-of-conduct.md Update code-of-conduct.md 2017-12-20 13:32:20 -05:00
go.mod Bump go.mod to 1.17 2022-01-28 20:10:20 -08:00
go.sum bump to logr v1.0.0-rc1 2021-06-13 10:57:37 -07:00
slow_git_clone.sh Check for rev to be present after a fetch 2021-06-23 14:56:39 -04:00
slow_git_fetch.sh Check for rev to be present after a fetch 2021-06-23 14:56:39 -04:00
test_e2e.sh reorder test cases 2022-03-11 08:41:01 -08:00
test_exechook_command.sh Set $GITSYNC_HASH in exechook 2022-02-06 16:26:04 -08:00
test_exechook_command_fail.sh cmd,hook and logging package 2021-08-16 12:40:27 +03:00
test_exechook_command_fail_with_sleep.sh Ensure web/exec hooks complete in --one-time 2022-01-19 12:01:15 -08:00
test_exechook_command_with_sleep.sh Ensure web/exec hooks complete in --one-time 2022-01-19 12:01:15 -08:00
tools.go Add infrastructure for build tools 2021-03-29 16:35:13 -07:00

README.md

git-sync

git-sync is a simple command that pulls a git repository into a local directory. It is a perfect "sidecar" container in Kubernetes - it can periodically pull files down from a repository so that an application can consume them.

git-sync can pull one time, or on a regular interval. It can pull from the HEAD of a branch, from a git tag, or from a specific git hash. It will only re-pull if the target of the run has changed in the upstream repository. When it re-pulls, it updates the destination directory atomically. In order to do this, it uses a git worktree in a subdirectory of the --root and flips a symlink.

git-sync can pull over HTTP(S) (with authentication or not) or SSH.

git-sync can also be configured to make a webhook call upon successful git repo synchronization. The call is made after the symlink is updated.

Building it

We use docker buildx to build images.

# build the container
make container REGISTRY=registry VERSION=tag
# build the container behind a proxy
make container REGISTRY=registry VERSION=tag \
    HTTP_PROXY=http://<proxy_address>:<proxy_port> \
    HTTPS_PROXY=https://<proxy_address>:<proxy_port>
# build the container for an OS/arch other than the current (e.g. you are on
# MacOS and want to run on Linux)
make container REGISTRY=registry VERSION=tag \
    GOOS=linux GOARCH=amd64

Usage

# make a directory (owned by you) for the volume
export DIR="/tmp/git-data"
mkdir -p $DIR

# run the container (as your own UID)
docker run -d \
    -v $DIR:/tmp/git \
    -u$(id -u):$(id -g) \
    registry/git-sync:tag \
        --repo=https://github.com/kubernetes/git-sync \
        --branch=master \
        --wait=30

# run an nginx container to serve the content
docker run -d \
    -p 8080:80 \
    -v $DIR:/usr/share/nginx/html \
    nginx

Webhooks

Webhooks are executed asynchronously from the main git-sync process. If a webhook-url is configured, when a change occurs to the local git checkout a call is sent using the method defined in webhook-method (default to POST). git-sync will continually attempt this webhook call until it succeeds (based on webhook-success-status). If unsuccessful, git-sync will wait webhook-backoff (default 3s) before re-attempting the webhook call.

Usage

A webhook is configured using a set of CLI flags. At its most basic only webhook-url needs to be set.

docker run -d \
    -v $DIR:/tmp/git \
    registry/git-sync:tag \
        --repo=https://github.com/kubernetes/git-sync \
        --branch=master \
        --wait=30 \
        --webhook-url="http://localhost:9090/-/reload"

Primary flags

Flag Environment Variable Default Description
--repo GIT_SYNC_REPO (required) the git repository to clone
--branch GIT_SYNC_BRANCH "master" the git branch to check out
--rev GIT_SYNC_REV "HEAD" the git revision (tag or hash) to check out
--root GIT_SYNC_ROOT "$HOME/git" the root directory for git-sync operations, under which --dest will be created
--dest GIT_SYNC_DEST "" the name of (a symlink to) a directory in which to check-out files under --root (defaults to the leaf dir of --repo)
--wait GIT_SYNC_WAIT 1 (second) the number of seconds between syncs
--one-time GIT_SYNC_ONE_TIME false exit after the first sync
--max-sync-failures GIT_SYNC_MAX_SYNC_FAILURES 0 the number of consecutive failures allowed before aborting (the first sync must succeed, -1 will retry forever after the initial sync)
-v (none) "" log level for V logs

Flags which control how git runs

Flag Environment Variable Default Description
--depth GIT_SYNC_DEPTH 0 use a shallow clone with a history truncated to the specified number of commits
--submodules GIT_SYNC_SUBMODULES recursive git submodule behavior: one of 'recursive', 'shallow', or 'off'
--timeout GIT_SYNC_TIMEOUT 120 the max number of seconds allowed for a complete sync
--sparse-checkout-file GIT_SYNC_SPARSE_CHECKOUT_FILE "" the location of an optional sparse-checkout file, same syntax as a .gitignore file.
--git-config GIT_SYNC_GIT_CONFIG "" additional git config options in 'key1:val1,key2:val2' format
--git-gc GIT_SYNC_GIT_GC "auto" git garbage collection behavior: one of 'auto', 'always', 'aggressive', or 'off'
--git GIT_SYNC_GIT "git" the git command to run (subject to PATH search, mostly for testing

Flags which configure authentication

Flag Environment Variable Default Description
--username GIT_SYNC_USERNAME "" the username to use for git auth
--password GIT_SYNC_PASSWORD "" the password or personal access token to use for git auth. (users should prefer --password-file or env vars for passwords)
--password-file GIT_SYNC_PASSWORD_FILE "" the path to password file which contains password or personal access token (see --password)
--ssh GIT_SYNC_SSH false use SSH for git operations
--ssh-key-file GIT_SSH_KEY_FILE "/etc/git-secret/ssh" the SSH key to use
--ssh-known-hosts GIT_KNOWN_HOSTS true enable SSH known_hosts verification
--ssh-known-hosts-file GIT_SSH_KNOWN_HOSTS_FILE "/etc/git-secret/known_hosts" the known_hosts file to use
--add-user GIT_SYNC_ADD_USER false add a record to /etc/passwd for the current UID/GID (needed to use SSH with a different UID)
--cookie-file GIT_COOKIE_FILE false use git cookiefile
--askpass-url GIT_ASKPASS_URL "" the URL for GIT_ASKPASS callback

Flags which configure hooks

Flag Environment Variable Default Description
--exechook-command GIT_SYNC_EXECHOOK_COMMAND "" the command executed with the syncing repository as its working directory after syncing a new hash of the remote repository. it is subject to the sync time out and will extend period between syncs. (doesn't support the command arguments)
--exechook-timeout GIT_SYNC_EXECHOOK_TIMEOUT 30 (seconds) the timeout for the sync hook command
--exechook-backoff GIT_SYNC_EXECHOOK_BACKOFF 3 (seconds) the time to wait before retrying a failed sync hook command
--webhook-url GIT_SYNC_WEBHOOK_URL "" the URL for a webhook notification when syncs complete
--webhook-method GIT_SYNC_WEBHOOK_METHOD "POST" the HTTP method for the webhook
--webhook-success-status GIT_SYNC_WEBHOOK_SUCCESS_STATUS 200 the HTTP status code indicating a successful webhook (-1 disables success checks to make webhooks fire-and-forget)
--webhook-timeout GIT_SYNC_WEBHOOK_TIMEOUT 1 (second) the timeout for the webhook
--webhook-backoff GIT_SYNC_WEBHOOK_BACKOFF 3 (seconds) the time to wait before retrying a failed webhook

Flags which configure observability

Flag Environment Variable Default Description
--http-bind GIT_SYNC_HTTP_BIND "" the bind address (including port) for git-sync's HTTP endpoint
--http-metrics GIT_SYNC_HTTP_METRICS true enable metrics on git-sync's HTTP endpoint
--http-pprof GIT_SYNC_HTTP_PPROF false enable the pprof debug endpoints on git-sync's HTTP endpoint

Other flags

Flag Environment Variable Default Description
--change-permissions GIT_SYNC_PERMISSIONS 0 the file permissions to apply to the checked-out files (0 will not change permissions at all)
--error-file GIT_SYNC_ERROR_FILE "" the name of a file into which errors will be written under --root