Skip to content

hristo-vrigazov/py-import-tree

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

54 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

py-import-tree

Analyzing the tree of imports of running Python code.

Uses a combination of AST (abstract syntax tree) and code execution (tracing), to give you useful insights into your project.

Example

Sample project

Create a new directory, and create two files:

  1. simple.py with contents:
from collections import defaultdict

def counts(arr):
    res = defaultdict(lambda: 0)
    for el in arr:
        res[el] += 1
    return res
  1. heavy.py with contents:
import torch

def torch_utils():
    print(torch.ones(10))

def something_simple():
    print('Boiler')

Now, let's analyze this project with py_import_tree!

py_import_tree usage

from py_import_tree.import_tracker import ImportTracker

# First time setup, this traverses imports found in code
# And executes them to find out which additional packages they bring in.
tracker = ImportTracker('py_import_tree_results')
tracker.dump_for_directory('.')

You should see output similar to:

[0/2]: Dumping heavy.py...
Collecting import torch "import torch"
Collecting after import torch "import torch"
Exiting import torch "import torch"
[1/2]: Dumping simple.py...
Collecting from collections import defaultdict "from collections import defaultdict"
Collecting after from collections import defaultdict "from collections import defaultdict"
Exiting from collections import defaultdict "from collections import defaultdict"

Next, we can load the results and inspect them (compute cohesion, etc.):

from py_import_tree.cohesion import ImportTree

tree = ImportTree.from_dump('py_import_tree_results')
cohesion = tree.cohesion()

Notice that if you want to import something_simple, you will need to import torch, despite the fact that torch is not used in the something_simple function. However, torch_utils and counts function lead to imports that are exactly what they use. So 2 out of the 3 function are with perfect cohesion, and 1 of them is with zero cohesion.

cohesion.score
0.6666666666666666

We can also check per definition results:

#dataframe with cohesion for every function and class.
cohesion.definitions
path definition import dependency dependency_weight definition_ideal_weight definition_actual_weight cohesion_score
0 heavy.py FunctionDef:torch_utils import torch nan 1452132786 1452497413 1452497413 1
4 heavy.py FunctionDef:torch_utils import torch nan 0 1452497413 1452497413 1
78 heavy.py FunctionDef:torch_utils import torch typing_extensions==3.7.4.3 83727 1452497413 1452497413 1
199 heavy.py FunctionDef:torch_utils import torch tqdm==4.59.0 280900 1452497413 1452497413 1
306 heavy.py FunctionDef:something_simple nan nan 0 0 1452497413 0
307 simple.py FunctionDef:counts from collections import defaultdict nan 0 0 0 1

You can also check how would the cohesion change if you move a function or a class to another file. For example, if we move the other simple function into the file that imports torch, this would make the cohesion even worse:

tree.what_if_function_moves('simple.py', 'counts', 'heavy.py').cohesion().score
0.3333333333333333

However, if we move the function that uses torch into a separate file, this would lead to perfect cohesion:

tree.what_if_function_moves('heavy.py', 'torch_utils', 'new.py').cohesion().score
1.0

What if we move the something_simple function into the simple.py file?

tree.what_if_function_moves('heavy.py', 'something_simple', 'simple.py').cohesion().score
1.0

You can also use the resulting dataframe to analyze exact, locked versions for each function/class in your project:

cohesion.definitions[['full_definition', 'dependency']].drop_duplicates()
full_definition dependency
438 heavy.py:FunctionDef:something_simple nan
0 heavy.py:FunctionDef:torch_utils torch==1.10.1-py3.8
6 heavy.py:FunctionDef:torch_utils nan
115 heavy.py:FunctionDef:torch_utils typing_extensions==3.7.4.3
274 heavy.py:FunctionDef:torch_utils tqdm==4.59.0
439 simple.py:FunctionDef:counts nan

Enjoy!

About

Analyzing the tree of imports of running Python code.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages