Skip to content

sellisd/pycodeseq

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Python package

A tool for extracting statistics from python ASTs (Abstract Syntax Trees) and jupyter notebooks

A command line program that recursively processes files to find either python scripts or jupyter notebooks and processes them to extract statistics. Python scripts and jupyter notebooks are treated in a very different maner.

From jupyter notebooks the size in terms of lines of code of code or markdown cells are aggregated across all files. For Python scripts there are two different options either aggregate the token rank and frequency distribution or calculate the size (in terms of lines of code) of all classes and functions in the scripts.

Installing / Getting started

python -m pip install clone git+https://github.com/sellisd/pycodeseq.git@main

To parse the ASTs in all python files found under pyrepository use the following:

pycodeseq --input_path pyrepository --output stats.tsv --method levels

To calculate the token rank frequency distribution of all python files under pyrepository run:

pycodeseq --input_path pyrepository --output stats.tsv --method tokens

To calculate the size of different types of cells in jupyter notebooks under pyrepository run:

pycodeseq --input_path pyrepository --output stats.tsv --method cells

Developing

To develop it is easier to clone and install editable:

git clone [email protected]:sellisd/pycodeseq.git
cd pycodeseq/
pip install -e .

About

Extract statistics from python code

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published