Skip to content

Gem that provides the ability to determine the evolution of code within a git repository

License

Notifications You must be signed in to change notification settings

kevinjalbert/git_evolution

Folders and files

NameName
Last commit message
Last commit date
Oct 25, 2021
Feb 23, 2015
Oct 24, 2021
Oct 25, 2021
Feb 23, 2015
Oct 25, 2021
Feb 20, 2015
Oct 26, 2021
Feb 23, 2015
Oct 24, 2021

Repository files navigation

GitEvolution

CI Coverage Status Code Climate

Have you ever curious how a specific line or group of lines got to that point? If those lines are captured in a git repository then the history is there. Being able to identify the evolution of said lines to their current state can be tricky and cumbersome. GitEvolution aims to solve this problem and provide a quick, and informative approach to understanding the evolution of changes in a git repository.

This is incredibly useful when dealing with source lines in a software project as the commit messages often contain nuggets of information on decisions for change.

Installation

gem install git_evolution

Usage

Command-line interface --help

$ git_evolution --help

Usage: git_evolution [options] <file>
    -r, --range N:N                  The specified range of lines to consider within the file (optional)
    -s, --since STRING               Consider the commits which are more recent than the specified time (optional)

Example Scenario

If we were interesting in the source code evolution that lead to rails's Array#forty_two:

$ git_evolution --range 70:75 ./rails/activesupport/lib/active_support/core_ext/array/access.rb

Commits:
utenmiki <utenmiki@gmail.com> (Thu Oct 31 23:20:15 2013 +0900) - 3f79d8423078f0671c8aa505ae199608d451663d
Add Rdoc document for Array#forty_two

Jeremy Kemper <jeremy@bitsweat.net> (Sat Mar 21 03:26:09 2009 -0700) - 83fd1ae122cf1ee4ea2c52e0bd963462163516ca
Convert array extension modules to class reopens

David Heinemeier Hansson <david@loudthinking.com> (Fri Nov 21 09:06:46 2008 +0100) - e50530ca3ab5db53ebc74314c54b62b91b932389
Reduced the number of literal aliases to the range that has actually seen personal use. With the massive savings in overhead, I was able to fit Array#forty_two

Pratik Naik <pratiknaik@gmail.com> (Sun Oct 5 22:16:26 2008 +0100) - a2932784bb71e72a78c32819ebd7ed2bed551e3e
Merge docrails

Pratik Naik <pratiknaik@gmail.com> (Mon Jul 28 12:26:59 2008 +0100) - 6e754551254a8cc64e034163f5d0dc155b450388
Merge docrails changes

David Heinemeier Hansson <david@loudthinking.com> (Tue Jun 17 13:37:57 2008 -0500) - 22af62cf486721ee2e45bb720c42ac2f4121faf4
Added Array#second through Array#tenth as aliases for Array#[1] through Array#[9] [DHH]

David Heinemeier Hansson <david@loudthinking.com> (Tue Nov 27 19:42:30 2007 +0000) - 4d177ae0d6d9f60c4000f45fb6f6df27317afbff
Added Array#from and Array#to that behaves just from String#from and String#to [DHH]

--------------------------------------------------------------------------------

Ownership (Commits):
David Heinemeier Hansson <david@loudthinking.com> - 3/7 (42.86%)
Pratik Naik <pratiknaik@gmail.com> - 2/7 (28.57%)
Jeremy Kemper <jeremy@bitsweat.net> - 1/7 (14.29%)
utenmiki <utenmiki@gmail.com> - 1/7 (14.29%)

Ownership (Changes):
David Heinemeier Hansson <david@loudthinking.com> - 53/84 (63.1%)
Pratik Naik <pratiknaik@gmail.com> - 20/84 (23.81%)
Jeremy Kemper <jeremy@bitsweat.net> - 9/84 (10.71%)
utenmiki <utenmiki@gmail.com> - 2/84 (2.38%)

GitEvolution provides a succinct output of the commits which contains any changes which lead to the current state. Its quick to see that e50530ca3ab5db53ebc74314c54b62b91b932389 was the introduction point for Array#forty_two.

In addition we have some ownership information with respect to commits and changes. The ownership information can be used to identify who to follow up with for additional context. In time new metrics and analysis could be attached to the output (i.e., factoring in time for ownership, types of changes, identifying file modifications such as movement or renames, etc...)

The Other (Hard) Way

If we were to identify the commit which introduced Array#forty_two we have two main options git blame and git log:

Using git blame

I highly advise against this approach as it involves a lot of manual work. You essentially use git blame to identify the previous commit which effects a line of concern within the area you are looking in.

$ git blame --follow ./activesupport/lib/active_support/core_ext/array/access.rb

...
83fd1ae1 (Jeremy Kemper            2009-03-21 03:26:09 -0700 70)   # Equal to <tt>self[41]</tt>. Also known as accessing "the reddit".
3f79d842 (utenmiki                 2013-10-31 23:20:15 +0900 71)   #
3f79d842 (utenmiki                 2013-10-31 23:20:15 +0900 72)   #   (1..42).to_a.forty_two # => 42
83fd1ae1 (Jeremy Kemper            2009-03-21 03:26:09 -0700 73)   def forty_two
83fd1ae1 (Jeremy Kemper            2009-03-21 03:26:09 -0700 74)     self[41]
4d177ae0 (David Heinemeier Hansson 2007-11-27 19:42:30 +0000 75)   end
...

You can then use a similar command to blame the parent git revision at that point.

$ git blame --follow 83fd1ae1^ -- ./activesupport/lib/active_support/core_ext/array/access.rb

...
e50530ca (David Heinemeier Hansson 2008-11-21 09:06:46 +0100 46)         # Equal to <tt>self[41]</tt>. Also known as accessing "the reddit".
e50530ca (David Heinemeier Hansson 2008-11-21 09:06:46 +0100 47)         def forty_two
e50530ca (David Heinemeier Hansson 2008-11-21 09:06:46 +0100 48)           self[41]
22af62cf (David Heinemeier Hansson 2008-06-17 13:37:57 -0500 49)         end
...

All the while you occasionally want to inspect the commit in more detail using git show <commit-sha>. Eventually you will end up where you want to be.

$ git show e50530ca

commit e50530ca3ab5db53ebc74314c54b62b91b932389
Author: David Heinemeier Hansson <david@loudthinking.com>
Date:   Fri Nov 21 09:06:46 2008 +0100

    Reduced the number of literal aliases to the range that has actually seen personal use. With the massive savings in overhead, I was able to fit Array#forty_two

Using git log

The following git log command presents the entire file history (26 commits):

$ git log --follow ./activesupport/lib/active_support/core_ext/array/access.rb

This works, but it is a lot more information to sift through.

A better approach (which is what GitEvolution uses under the hood) presents the file history for only the concerned portion (7 commits):

$ git log -L70,75:./activesupport/lib/active_support/core_ext/array/access.rb

The output isn't succinct, nor does it have ownership information. The command is also more verbose.

Editor

I made a Vim function that allows me to visually select and call git_evolution using ge on the selected lines. This opens the output in a new buffer which I can then look through and yank commit SHAs if needed.

Contributing

  1. Fork it ( https://github.com/kevinjalbert/git_evolution/fork )
  2. Create your feature branch (git checkout -b my-new-feature)
  3. Commit your changes (git commit -am 'Add some feature')
  4. Push to the branch (git push origin my-new-feature)
  5. Create a new Pull Request

About

Gem that provides the ability to determine the evolution of code within a git repository

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages