Git is a powerful and indispensable tool for software developers, DevOps engineers, and even those working in other fields that involve managing and collaborating on projects with evolving content. As a distributed version control system, Git allows individuals and teams to track changes to their work efficiently and effectively.
While Git is the most popular and widely used version control system today, it’s important to note that it’s not the only one available. Other popular options include:
- Subversion (SVN): A centralized version control system that was widely used before the rise of Git.
- Mercurial: Another distributed version control system known for its speed and simplicity.
- Perforce: A commercial version control system often used in large-scale software development projects.
However, Git has emerged as the dominant force in version control due to several key advantages:
- Distributed Nature: Git allows developers to have a complete copy of the repository on their local machines, enabling offline work and faster operations.
- Branching and Merging: Git excels at branching and merging, making it easy to experiment with new features and integrate changes from different developers.
- Large and Active Community: A vast and active community provides extensive documentation, support, and a wealth of third-party tools.
- Flexibility and Versatility: Git can be used for a wide range of projects, from small personal projects to large-scale enterprise applications.
This guide will provide you with a solid foundation in Git, covering essential commands, best practices, and common troubleshooting tips. Whether you’re a seasoned developer or just starting your coding journey, Git is a crucial skill to acquire.
I hope this revised introduction provides a more comprehensive overview of Git and its significance in the context of other version control systems.
Getting Started with Git
Before you can start using Git, you’ll need to install it on your system. The installation is quite simple and many OS provide it installed by default but in case you don’t have it already, just follow the quick steps below.
- Installation:
- On macOS: Git is often pre-installed. You can verify the installation by opening your terminal and typing
git --version
. If Git is not installed, you can easily install it using Homebrew:brew install git
. - On Linux: Git is typically available in your distribution’s package manager. For example, on Debian/Ubuntu, you can install it using
sudo apt-get install git
. - On Windows: Download and install the official Git for Windows installer from the Git website (https://git-scm.com/).
- On macOS: Git is often pre-installed. You can verify the installation by opening your terminal and typing
Initializing a Git Repository (git init
or git clone
)
You can start using Git with a project in two ways:
- Creating a new repository: If you’re starting a project from scratch, you can create a new repository from any folder on your computer. This is done via
git init
command. This creates a hidden.git
directory in your project, which contains all the necessary files for Git to track changes. - Clone an existing repository: When you are working on an existing project, it should be hosted on a remote server (e.g., GitHub, GitLab, Bitbucket) and you can clone it to your local computer to work. This is done using
git clone
that will be cover in more detail in the “Working with Remote Repositories” section.
What is a Repository?
A Git repository is essentially a collection of files and their histories. It acts as a central hub for all the changes made to a project’s code. Each repository contains a complete record of every modification, making it easy to track the evolution of your project and revert to previous versions if needed.
Checking Git Status (git status
)
The git status
command provides you with an overview of the current state of your repository. It tells you:
- Which files have been modified: Git tracks any changes you make to files within the repository.
- Which files are staged for the next commit: Files that have been added to the staging area using
git add
are marked as staged. - The current branch: Git uses branches to manage different lines of development.
git status
will indicate the currently active branch.
By checking the status of your repository, you understand what’s the current state of it what files were changed or if you are in a different branch than expected. We will see it in more details in the sections below.
The Git Workflow
Making changes in Git involves 3 steps.
Making Changes
That’s what you would do even without a versioning system. You can modify your files as usual and git will care care of detecting the changes you’ve made in the next steps. This could involve editing existing code, adding new files, or deleting files. For Git, there is no comamnd to move or rename files, it will automatically detect it by the changes you made. At this point, you running git status
will show you what files were changed and the type of change on them.
At this point you can also use git diff
command to see the differences in your repository or git diff <filename>
to see specific files or folders. It will bring you all the differences that you have made that are not commited or stagged.
Staging Changes (git add
)
Staging is like preparing a selection of changes for the next “snapshot” or commit in Git terms. Imagine it like choosing which items to pack for a trip. You don’t pack everything at once; you select the necessary items and put them in your suitcase. Similarly, with Git, you select the specific changes you want to include in the next commit using the git add
command. Suppose that you have multiple changes and you want to release only some of them. That’s how you do it in Git.
You can use git add <filename>
to add specific files or folders (it will include all changes in the folder) or git add .
in the root of the repository to add all changes at once.
You can also see the staged changes by using git diff --staged
command. It will bring all changes staged and ignore changes not yet staged. It is usefull if you want to check what will be committed on the next step.
Committing Changes (git commit
)
A commit is a snapshot of your project at a specific point in time. It records all the staged changes along with a descriptive message that explains what you’ve modified. That’s how you can track all changes made and who made them.
To create a commit, use the following command:
git commit -m "Your Commit Message"
- The
-m
flag allows you to include your commit message directly in the command. - Write clear and concise commit messages: They should briefly describe the changes made (e.g., “Fixed bug in login function,” “Added new feature: user profiles”).
Another option is to do both stagging and commiting in a single command by including -a
to it.
git commit -am "Your Commit Message"
This will stagged all changed files and include all your commit. It is an interesting shortcut but you should still understand the stagging step since IDEs will normally work with them.
Viewing Commit History (git log
)
Now that you’ve commited some changes to the repository, it is important to see what changes were made and who and when they were done. The git log
command allows you to view the history of commits made to your repository. This is incredibly useful for:
- Understanding how your project has evolved over time.
- Finding specific changes or identifying the source of a particular bug.
- Inspecting the details of each commit (author, date, message).
By default, git log
displays a chronological list of commits, with the most recent commit at the top. You can customize the output of git log
with various options, such as:
--oneline
: Displays a concise summary of each commit on a single line.--graph
: Visualizes the commit history with a graph. Specially useful if you work with branches as described below.--author=<author_name>
: Filters the log to show commits made by a specific author.
Branching and Merging
Git’s branching system is one of its most powerful features. Branches allow developers to work on new features, experiment with different ideas, or fix bugs independently without affecting the main line of development. Just imagine that many developers are working in the same project and some are working on new features while others are fixing bugs. Using branches is how you can guarantee that only changes that are completely done (including testing and all required validations) are part of your main line of development.
Creating a New Branch (git branch
)
By default, git will create a default branch for the repository, normally called master
but can have any name. That’s were your main line of development will be. To start working with a branch, you need to create it.
To create a new branch, use the command:
git branch <branch_name>
Note that this command will create a new branch but will not switch to it. In other words, it will remain in your current branch (e.g. master). We will talk about how to switch between branches in the next section. If you want to create a branch and immediately switch to it, use this command instead:
git checkout -b <branch_name>
You can also list all branches you have locally by using the following command
git branch
This will give you a full list of branches in your local repository.
Switching Between Branches (git checkout
)
In many situations you might need to switch between branches in your repository to do your changes. For example, imagine that you are working on a new feature and someone asks you to do a bugfix. In this case you will need to switch from your branch to the master branch, create a new branch for the bugfix and switch to it. This will give you a clean workspace to work on the bugfix, without affecting your work done on the new feature.
To switch to a different branch, use the git checkout
<branch_name>
command. For example, to switch to the “feature/new-feature” branch: git checkout feature/new-feature
and to switch back to main: git checkout main
Switching between branches requires that your workspace is clean without changes, stagged or non-stagged you should either commit or revert your changes (see also stashing below).
Merging Branches (git merge
)
Once you’ve finished working on your branch and are ready to incorporate it into the main line of development, you can merge it using the git merge
command.
For example, to merge the “feature/new-feature” branch into the “main” branch you should follow these steps
- Have all changes commited on feature/new-feature branch
- Switch to the main branch:
git checkout main
- Merge the changes from feature/new-feature into main branch:
git merge feature/new-feature
This can also be done on the opposite direction. Let’s imagine that some other changes were incorporated into the main branch after you started your branch and now you want to add them to your branch. You can run git merge main
to pull all missing changes to your branch and git will take care of it.
Merge Conflicts
Git is very good at merging these cases of parallel changes but not perfect. Sometimes, conflicts can occur during merging. This happens when both branches have made changes to the same lines of code. Git will indicate the conflicts, and you’ll need to manually resolve them before completing the merge. Resolving merge conflicts is a bit more complex topic and I will create a separate post to explain how to do it.
Working with Remote Repositories
One of the most important parts of working with git is the ability to sync with remote repositories. So far, we have being working only with our local repository but when you have a team of engineers working on the same project, you need a centralized place to store all your code and changes. That is were platforms like GitHub, GitLab, and BitBucket just to cite some, come into play. They are repositories of Git repositories in which a team can work with. Among other things these are some of the advantages of using them:
- Collaborate with other developers by sharing your code and merging their contributions.
- Back up your code securely online.
- Allow to enforce rules for merging changes (e.g. at least one reviewer, tests should pass)
But if you can’t commit directly to them (ignoring their local editors that are not an option for complex changes), how can you use them? By syncing your changes in your local repository with them.
Cloning a Repository (git clone
)
We briefly talked about it in “Initializing a Git Repository” section but now we can talk about it in more detail. Cloning a repository does exactly what it says: Creates a clone from a remote repository in your local computer. It also keeps the remote repository url for future synchronization. It means that all files and changes to that repository will be downloaded to your local fold and you will be able to see and change it locally. Here is one example of the command:
git clone https://github.com/username/project-name.git
You will find the URL to your repository on the platform and you might need some authentication to execute it. That depends on the platform you are using but you can easily find instructions on their website.
Pushing Changes to the Remote Repository (git push
)
Once you’ve made changes to your local repository, you can push them to the remote repository using the command:
git push
By default, git push
will push your current branch to the remote repository that defaults to “origin” (the name given to the remote repository when you clone it).
Important Note: If you created your branch locally and it does not yet exist on the remote repository, you need to specify the destination branch:
git push origin <branch_name>
This will create a new branch with the same name on the remote repository and push your local changes to it.
Setting the Default Destination
You can set the default destination for a branch using the following command:
git push --set-upstream origin <branch_name>
After setting the upstream branch, you can simply use git push
to push changes to the remote repository without having to specify the branch name each time.
Pulling Changes from the Remote Repository (git pull
)
If other developers have made changes to the remote repository, you can update your local copy using the git pull
command
git pull <remote_name> <branch_name>
This command fetches the latest changes from the remote repository and merges them into your current branch.
This command, as happens with merge command, can also cause some conflicts if some change in the remote repository changed the same lines as you did.
Essential Git Commands
This section provides a concise overview of some essential Git commands that you’ll frequently encounter:
git status
: Displays the current state of your working directory, including which files have been modified, staged, and untracked.git add
: Stages changes to be included in the next commit.git add <filename>
: Stages changes to a specific file.git add .
: Stages changes to all files in the current directory.
git commit -m "Your Message"
: Creates a new commit with the specified message.git log
: Displays the history of commits in your repository.git log --oneline
: Displays a concise summary of each commit on a single line.git log --graph
: Visualizes the commit history with a graph.
git branch
:git branch <branch_name>
: Creates a new branch.git branch
: Lists all existing branches.
git checkout <branch_name>
: Switches to the specified branch.git merge <branch_name>
: Merges the specified branch into the current branch.git clone <remote_repository_url>
: Clones an existing repository from a remote server.git push
: Pushes local commits to the remote repository.git pull
: Fetches and merges changes from the remote repository into the local branch.
Beyond the Basics:
git diff
: Shows the differences between files.git reset
: Undoes changes (use with caution!).git revert
: Reverses a specific commit.git stash
: Temporarily saves uncommitted changes.
This list covers some of the most frequently used Git commands. As you gain more experience, you’ll discover and utilize other helpful commands to streamline your workflow.
Best Practices
Following these best practices will help you work effectively with Git and maintain a clean and well-organized project history:
- Write Clear and Concise Commit Messages:
- Use the present tense (e.g., “Add user authentication feature”).
- Be specific and descriptive.
- Avoid vague messages like “Fix bugs” or “Update code”.
- Keep your messages short and to the point.
- Keep Branches Organized:
- Use descriptive branch names that clearly indicate the purpose of the branch (e.g., “feature/new-feature”, “bugfix/issue-123”).
- Avoid creating excessively long or complex branch names.
- Delete branches that are no longer needed to keep your repository clean.
- Rebase Frequently (with Caution):
- Rebasing can help to create a cleaner and more linear commit history.
- However, use rebase with caution, as it can alter the history of your branch and may cause conflicts if others have already pulled your changes.
- Use Git Stash to Temporarily Store Changes:
- If you need to switch branches but have uncommitted changes, use
git stash
to temporarily save them. You can later reapply the stashed changes usinggit stash pop
.
- If you need to switch branches but have uncommitted changes, use
- Review and Merge Pull Requests Carefully:
- When working in a team, carefully review and test pull requests before merging them into the main branch.
- Provide constructive feedback to the author of the pull request.
- Learn to Use Git Log Effectively:
- Utilize the various options of
git log
(e.g.,--oneline
,--graph
) to effectively navigate and understand your project’s history.
- Utilize the various options of
By following these best practices, you can improve your Git workflow, enhance collaboration within your team, and maintain a healthy and well-organized project history.
Troubleshooting
Git can sometimes throw unexpected errors or present challenges. Here are some common issues and how to address them:
- Merge Conflicts:
- When merging branches, conflicts can occur if both branches have made changes to the same lines of code.
- Git will indicate the conflicting sections in the affected files.
- You’ll need to manually resolve these conflicts by editing the files to incorporate the desired changes from both branches.
- Once resolved, stage the changes using
git add
and then commit the merge.
- Unstaged Changes:
- If you’re trying to commit but get an error about unstaged changes, use
git add
to stage the files you want to include in the commit.
- If you’re trying to commit but get an error about unstaged changes, use
- Incorrect Commit Messages:
- If you accidentally commit with a misleading or incomplete message, you can amend the commit using
git commit --amend
.
- If you accidentally commit with a misleading or incomplete message, you can amend the commit using
- Accidental Commits:
- If you’ve accidentally committed changes you didn’t intend to, you can use
git revert <commit_hash>
to create a new commit that reverts the changes introduced by the previous commit.
- If you’ve accidentally committed changes you didn’t intend to, you can use
- Remote Connection Issues:
- If you encounter problems pushing or pulling from a remote repository, check your internet connection, ensure that the remote repository is accessible, and verify your credentials.
- Git Configuration Issues:
- If you’re encountering unexpected behavior, check your Git configuration settings using
git config --list
. You can also usegit config
to customize your Git experience.
- If you’re encountering unexpected behavior, check your Git configuration settings using
Remember that Git has extensive documentation available online. If you encounter an error message, try searching for it online, as there’s a good chance someone else has encountered and resolved the same issue.
Conclusion
Git is an essential tool for any developer, regardless of their experience level. By mastering its core concepts and commands, you can significantly improve your workflow, collaborate more effectively with others, and build high-quality software.
This guide has provided you with a solid foundation in Git, covering fundamental concepts such as branching, merging, and working with remote repositories. Remember that Git is a powerful and versatile tool, and there’s always more to learn.
I encourage you to continue exploring Git’s features, experiment with different workflows, and leverage its capabilities to enhance your development process. Happy coding!
FAQs
- What is the difference between Git and GitHub?
- Git is a version control system, while GitHub is a popular platform that provides hosting for Git repositories. GitHub offers features like issue tracking, project management, and code review tools.
- How do I resolve merge conflicts?
- When merge conflicts occur, Git will indicate the conflicting sections in your files. You’ll need to manually edit the files to resolve the conflicts, choosing which changes to keep from each branch.
- What is a fork?
- A fork is a copy of a repository. It allows you to create your own version of a project and make changes without affecting the original repository. When you create a fork on GitHub for example, you are cloning a repository to your own account.
- How do I undo a commit?
- You can use
git revert <commit_hash>
to create a new commit that reverses the changes introduced by the specified commit.
- You can use
- How do I delete a branch?
- Use
git branch -d <branch_name>
to delete a local branch. If you’ve already pushed the branch to a remote repository, you can delete it remotely usinggit push origin --delete <branch_name>
.
- Use