7.5. Logging and Debugging#

Collections of tools for logging and debugging Python code.

7.5.1. rich.inspect: Produce a Beautiful Report on any Python Object#

Hide code cell content
!pip install rich 
Requirement already satisfied: rich in /Users/khuyentran/book/venv/lib/python3.11/site-packages (13.7.1)
Requirement already satisfied: markdown-it-py>=2.2.0 in /Users/khuyentran/book/venv/lib/python3.11/site-packages (from rich) (2.2.0)
Requirement already satisfied: pygments<3.0.0,>=2.13.0 in /Users/khuyentran/book/venv/lib/python3.11/site-packages (from rich) (2.16.1)
Requirement already satisfied: mdurl~=0.1 in /Users/khuyentran/book/venv/lib/python3.11/site-packages (from markdown-it-py>=2.2.0->rich) (0.1.2)

If you want to quickly see which attributes and methods of a Python object are available, use richโ€™s inspect method.

richโ€™s inspect method allows you to create a beautiful report for any Python object, including a string.

from rich import inspect

print(inspect('hello', methods=True))
โ•ญโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€ <class 'str'> โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฎ
โ”‚ str(object='') -> str                                                                                           โ”‚
โ”‚ str(bytes_or_buffer[, encoding[, errors]]) -> str                                                               โ”‚
โ”‚                                                                                                                 โ”‚
โ”‚ โ•ญโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฎ โ”‚
โ”‚ โ”‚ 'hello'                                                                                                     โ”‚ โ”‚
โ”‚ โ•ฐโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฏ โ”‚
โ”‚                                                                                                                 โ”‚
โ”‚   capitalize = def capitalize(): Return a capitalized version of the string.                                    โ”‚
โ”‚     casefold = def casefold(): Return a version of the string suitable for caseless comparisons.                โ”‚
โ”‚       center = def center(width, fillchar=' ', /): Return a centered string of length width.                    โ”‚
โ”‚        count = def count(...) S.count(sub[, start[, end]]) -> int                                               โ”‚
โ”‚       encode = def encode(encoding='utf-8', errors='strict'): Encode the string using the codec registered for  โ”‚
โ”‚                encoding.                                                                                        โ”‚
โ”‚     endswith = def endswith(...) S.endswith(suffix[, start[, end]]) -> bool                                     โ”‚
โ”‚   expandtabs = def expandtabs(tabsize=8): Return a copy where all tab characters are expanded using spaces.     โ”‚
โ”‚         find = def find(...) S.find(sub[, start[, end]]) -> int                                                 โ”‚
โ”‚       format = def format(...) S.format(*args, **kwargs) -> str                                                 โ”‚
โ”‚   format_map = def format_map(...) S.format_map(mapping) -> str                                                 โ”‚
โ”‚        index = def index(...) S.index(sub[, start[, end]]) -> int                                               โ”‚
โ”‚      isalnum = def isalnum(): Return True if the string is an alpha-numeric string, False otherwise.            โ”‚
โ”‚      isalpha = def isalpha(): Return True if the string is an alphabetic string, False otherwise.               โ”‚
โ”‚      isascii = def isascii(): Return True if all characters in the string are ASCII, False otherwise.           โ”‚
โ”‚    isdecimal = def isdecimal(): Return True if the string is a decimal string, False otherwise.                 โ”‚
โ”‚      isdigit = def isdigit(): Return True if the string is a digit string, False otherwise.                     โ”‚
โ”‚ isidentifier = def isidentifier(): Return True if the string is a valid Python identifier, False otherwise.     โ”‚
โ”‚      islower = def islower(): Return True if the string is a lowercase string, False otherwise.                 โ”‚
โ”‚    isnumeric = def isnumeric(): Return True if the string is a numeric string, False otherwise.                 โ”‚
โ”‚  isprintable = def isprintable(): Return True if the string is printable, False otherwise.                      โ”‚
โ”‚      isspace = def isspace(): Return True if the string is a whitespace string, False otherwise.                โ”‚
โ”‚      istitle = def istitle(): Return True if the string is a title-cased string, False otherwise.               โ”‚
โ”‚      isupper = def isupper(): Return True if the string is an uppercase string, False otherwise.                โ”‚
โ”‚         join = def join(iterable, /): Concatenate any number of strings.                                        โ”‚
โ”‚        ljust = def ljust(width, fillchar=' ', /): Return a left-justified string of length width.               โ”‚
โ”‚        lower = def lower(): Return a copy of the string converted to lowercase.                                 โ”‚
โ”‚       lstrip = def lstrip(chars=None, /): Return a copy of the string with leading whitespace removed.          โ”‚
โ”‚    maketrans = def maketrans(...) Return a translation table usable for str.translate().                        โ”‚
โ”‚    partition = def partition(sep, /): Partition the string into three parts using the given separator.          โ”‚
โ”‚ removeprefix = def removeprefix(prefix, /): Return a str with the given prefix string removed if present.       โ”‚
โ”‚ removesuffix = def removesuffix(suffix, /): Return a str with the given suffix string removed if present.       โ”‚
โ”‚      replace = def replace(old, new, count=-1, /): Return a copy with all occurrences of substring old replaced โ”‚
โ”‚                by new.                                                                                          โ”‚
โ”‚        rfind = def rfind(...) S.rfind(sub[, start[, end]]) -> int                                               โ”‚
โ”‚       rindex = def rindex(...) S.rindex(sub[, start[, end]]) -> int                                             โ”‚
โ”‚        rjust = def rjust(width, fillchar=' ', /): Return a right-justified string of length width.              โ”‚
โ”‚   rpartition = def rpartition(sep, /): Partition the string into three parts using the given separator.         โ”‚
โ”‚       rsplit = def rsplit(sep=None, maxsplit=-1): Return a list of the substrings in the string, using sep as   โ”‚
โ”‚                the separator string.                                                                            โ”‚
โ”‚       rstrip = def rstrip(chars=None, /): Return a copy of the string with trailing whitespace removed.         โ”‚
โ”‚        split = def split(sep=None, maxsplit=-1): Return a list of the substrings in the string, using sep as    โ”‚
โ”‚                the separator string.                                                                            โ”‚
โ”‚   splitlines = def splitlines(keepends=False): Return a list of the lines in the string, breaking at line       โ”‚
โ”‚                boundaries.                                                                                      โ”‚
โ”‚   startswith = def startswith(...) S.startswith(prefix[, start[, end]]) -> bool                                 โ”‚
โ”‚        strip = def strip(chars=None, /): Return a copy of the string with leading and trailing whitespace       โ”‚
โ”‚                removed.                                                                                         โ”‚
โ”‚     swapcase = def swapcase(): Convert uppercase characters to lowercase and lowercase characters to uppercase. โ”‚
โ”‚        title = def title(): Return a version of the string where each word is titlecased.                       โ”‚
โ”‚    translate = def translate(table, /): Replace each character in the string using the given translation table. โ”‚
โ”‚        upper = def upper(): Return a copy of the string converted to uppercase.                                 โ”‚
โ”‚        zfill = def zfill(width, /): Pad a numeric string with zeros on the left, to fill a field of the given   โ”‚
โ”‚                width.                                                                                           โ”‚
โ•ฐโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฏ
None

7.5.2. Richโ€™s Console: Debug your Python Function in One Line of Code#

Hide code cell content
!pip install rich 
Requirement already satisfied: rich in /Users/khuyentran/book/venv/lib/python3.11/site-packages (13.7.1)
Requirement already satisfied: markdown-it-py>=2.2.0 in /Users/khuyentran/book/venv/lib/python3.11/site-packages (from rich) (2.2.0)
Requirement already satisfied: pygments<3.0.0,>=2.13.0 in /Users/khuyentran/book/venv/lib/python3.11/site-packages (from rich) (2.16.1)
Requirement already satisfied: mdurl~=0.1 in /Users/khuyentran/book/venv/lib/python3.11/site-packages (from markdown-it-py>=2.2.0->rich) (0.1.2)

Sometimes, you might want to know which elements in the function created a certain output. Instead of printing every variable in the function, you can simply use Richโ€™s Console object to print both the output and all the variables in the function.

from rich import console
from rich.console import Console 
import pandas as pd 

console = Console()

data = pd.DataFrame({'a': [1, 2, 3], 'b': [4, 5, 6]})

def edit_data(data):
    var_1 = 45
    var_2 = 30
    var_3 = var_1 + var_2
    data['a'] = [var_1, var_2, var_3]
    console.log(data, log_locals=True)

edit_data(data)
[18:10:38]     a  b                                                                                1165738010.py:14
           0  45  4                                                                                                
           1  30  5                                                                                                
           2  75  6                                                                                                
           โ•ญโ”€โ”€โ”€โ”€โ”€ locals โ”€โ”€โ”€โ”€โ”€โ•ฎ                                                                                    
           โ”‚  data =     a  b โ”‚                                                                                    
           โ”‚         0  45  4 โ”‚                                                                                    
           โ”‚         1  30  5 โ”‚                                                                                    
           โ”‚         2  75  6 โ”‚                                                                                    
           โ”‚ var_1 = 45       โ”‚                                                                                    
           โ”‚ var_2 = 30       โ”‚                                                                                    
           โ”‚ var_3 = 75       โ”‚                                                                                    
           โ•ฐโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฏ                                                                                    

Link to my article about rich.

Link to rich.

7.5.3. Simplify Python Logging with Loguru#

Have you ever found yourself using print() instead of a proper logger due to the hassle of setup?

With Loguru, you can get started with logging right away. A single import is all you need to begin logging with pre-configured color and format settings.

Here is the comparison between the standard Python logging library and Loguru:

Standard Python logging library:

# loguru_vs_logging/logging_example.py

import logging

logging.basicConfig(format='%(asctime)s | %(levelname)s | %(module)s:%(funcName)s:%(lineno)d | %(message)s', level=logging.DEBUG)

def main():
    logging.debug('This is a debug message')
    logging.info('This is an info message')
    logging.warning('This is a warning message')
    logging.error('This is an error message')
    logging.critical('This is a critical message')

if __name__ == '__main__':
    main()
$ python loguru_vs_logging/logging_example.py

2023-03-13 08:46:30,802 | DEBUG | logging_example:main:6 | This is a debug message
2023-03-13 08:46:30,802 | INFO | logging_example:main:7 | This is an info message
2023-03-13 08:46:30,802 | WARNING | logging_example:main:8 | This is a warning message
2023-03-13 08:46:30,802 | ERROR | logging_example:main:9 | This is an error message
2023-03-13 08:46:30,802 | CRITICAL | logging_example:main:10 | This is a critical message

Loguru:

# loguru_vs_logging/loguru_example.py
from loguru import logger

def main():
    logger.debug("This is a debug message")
    logger.info("This is an info message")
    logger.warning("This is a warning message")
    logger.error("This is an error message")
    logger.critical("This is a critical message")


if __name__ == '__main__':
    main()
$ python loguru_vs_logging/loguru_example.py

Link to loguru.

7.5.4. loguru: Print Readable Traceback in Python#

Hide code cell content
!pip install loguru 
Collecting loguru
  Downloading loguru-0.7.2-py3-none-any.whl.metadata (23 kB)
Downloading loguru-0.7.2-py3-none-any.whl (62 kB)
Installing collected packages: loguru
Successfully installed loguru-0.7.2

Loguru also allows you to print a readable traceback by adding decorator logger.catch to a function.

from sklearn.metrics import mean_squared_error
import numpy as np
from loguru import logger

logger.add("file_{time}.log", format="{time} {level} {message}")

@logger.catch
def evaluate_result(y_true: np.array, y_pred: np.array):
    mean_square_err = mean_squared_error(y_true, y_pred)
    root_mean_square_err = mean_square_err ** 0.5

y_true = np.array([1, 2, 3])
y_pred = np.array([1.5, 2.2])
evaluate_result(y_true, y_pred)
> File "/tmp/ipykernel_174022/1865479429.py", line 14, in <module>
    evaluate_result(y_true, y_pred)
    โ”‚               โ”‚       โ”” array([1.5, 2.2])
    โ”‚               โ”” array([1, 2, 3])
    โ”” <function evaluate_result at 0x7f279588f430>

  File "/tmp/ipykernel_174022/1865479429.py", line 9, in evaluate_result
    mean_square_err = mean_squared_error(y_true, y_pred)
                      โ”‚                  โ”‚       โ”” array([1.5, 2.2])
                      โ”‚                  โ”” array([1, 2, 3])
                      โ”” <function mean_squared_error at 0x7f27958bfca0>

  File "/home/khuyen/book/venv/lib/python3.8/site-packages/sklearn/utils/validation.py", line 63, in inner_f
    return f(*args, **kwargs)
           โ”‚  โ”‚       โ”” {}
           โ”‚  โ”” (array([1, 2, 3]), array([1.5, 2.2]))
           โ”” <function mean_squared_error at 0x7f27958bfb80>
  File "/home/khuyen/book/venv/lib/python3.8/site-packages/sklearn/metrics/_regression.py", line 335, in mean_squared_error
    y_type, y_true, y_pred, multioutput = _check_reg_targets(
            โ”‚       โ”‚                     โ”” <function _check_reg_targets at 0x7f27958b7af0>
            โ”‚       โ”” array([1.5, 2.2])
            โ”” array([1, 2, 3])
  File "/home/khuyen/book/venv/lib/python3.8/site-packages/sklearn/metrics/_regression.py", line 88, in _check_reg_targets
    check_consistent_length(y_true, y_pred)
    โ”‚                       โ”‚       โ”” array([1.5, 2.2])
    โ”‚                       โ”” array([1, 2, 3])
    โ”” <function check_consistent_length at 0x7f279676e040>
  File "/home/khuyen/book/venv/lib/python3.8/site-packages/sklearn/utils/validation.py", line 319, in check_consistent_length
    raise ValueError("Found input variables with inconsistent numbers of"

ValueError: Found input variables with inconsistent numbers of samples: [3, 2]

Link to loguru.

7.5.5. Icecream: Never use print() to debug again#

Hide code cell content
!pip install icecream
Collecting icecream
  Downloading icecream-2.1.3-py2.py3-none-any.whl.metadata (1.4 kB)
Requirement already satisfied: colorama>=0.3.9 in /Users/khuyentran/book/venv/lib/python3.11/site-packages (from icecream) (0.4.6)
Requirement already satisfied: pygments>=2.2.0 in /Users/khuyentran/book/venv/lib/python3.11/site-packages (from icecream) (2.16.1)
Requirement already satisfied: executing>=0.3.1 in /Users/khuyentran/book/venv/lib/python3.11/site-packages (from icecream) (2.0.0)
Requirement already satisfied: asttokens>=2.0.1 in /Users/khuyentran/book/venv/lib/python3.11/site-packages (from icecream) (2.4.0)
Requirement already satisfied: six>=1.12.0 in /Users/khuyentran/book/venv/lib/python3.11/site-packages (from asttokens>=2.0.1->icecream) (1.16.0)
Downloading icecream-2.1.3-py2.py3-none-any.whl (8.4 kB)
Installing collected packages: icecream
Successfully installed icecream-2.1.3

If you use print or log to debug your code, you might be confused about which line of code creates the output, especially when there are many outputs.

You might insert text to make it less confusing, but it is time-consuming.

from icecream import ic

def plus_one(num):
    return num + 1

print('output of plus_on with num = 1:', plus_one(1))
print('output of plus_on with num = 2:', plus_one(2))
output of plus_on with num = 1: 2
output of plus_on with num = 2: 3

Try icecream instead. Icrecream inspects itself and prints both its own arguments and the values of those arguments like below.

ic(plus_one(1))
ic(plus_one(2))
Hide code cell output
ic| plus_one(1): 2
ic| plus_one(2): 3
3

Output:

ic| plus_one(1): 2
ic| plus_one(2): 3

Link to icecream

Link to my article about icecream

7.5.6. heartrate โ€” Visualize the Execution of a Python Program in Real-Time#

Hide code cell content
!pip install heartrate 

Understanding how your Python code executes in real-time and identifying performance bottlenecks or execution patterns results in a lot of manual debugging and print statements. This often leads to cluttered code and incomplete understanding of program flow.

def factorial(x, depth=0):
    print(f"Calculating factorial({x})")
    
    if x == 1:
        print(f"Base case: factorial(1) = 1")
        return 1
    else:
        result = x * factorial(x-1, depth + 1)
        print(f"factorial({x}) = {x} * factorial({x-1}) = {result}")
        return result


if __name__ == "__main__":
    num = 5
    result = factorial(num)
    print(f"The factorial of {num} is {factorial(num)}")
Calculating factorial(5)
Calculating factorial(4)
Calculating factorial(3)
Calculating factorial(2)
Calculating factorial(1)
Base case: factorial(1) = 1
factorial(2) = 2 * factorial(1) = 2
factorial(3) = 3 * factorial(2) = 6
factorial(4) = 4 * factorial(3) = 24
factorial(5) = 5 * factorial(4) = 120
Calculating factorial(5)
Calculating factorial(4)
Calculating factorial(3)
Calculating factorial(2)
Calculating factorial(1)
Base case: factorial(1) = 1
factorial(2) = 2 * factorial(1) = 2
factorial(3) = 3 * factorial(2) = 6
factorial(4) = 4 * factorial(3) = 24
factorial(5) = 5 * factorial(4) = 120
The factorial of 5 is 120

You can use heartrate to visualize your code execution in real-time through a browser interface. It shows line execution counts, recent activity with color-coded bars, and a live stacktrace without modifying your source code.

You only need to add two lines of code to use heartrate.

from time import sleep
import heartrate 
heartrate.trace(browser=True)

def factorial(x):
    if x == 1:
        sleep(1)
        return 1
    else:
        sleep(1)
        return (x * factorial(x-1))


if __name__ == "__main__":
    num = 5
    print(f"The factorial of {num} is {factorial(num)}")
Hide code cell output
The factorial of 5 is 120

You should see something similar to the below when opening the browser:

image

Explanation of the visualization:

  • Line hit counts on the left side

  • Visual bars showing recent line executions (longer = more hits, lighter = more recent)

  • Currently executing lines highlighted

Link to heartrate.

7.5.7. pyheat: Simplify Code Profiling with Heatmap Visualization#

Hide code cell content
!pip install py-heat 
Collecting py-heat
  Downloading py-heat-0.0.6.tar.gz (7.1 kB)
  Preparing metadata (setup.py) ... ?25l-^C
ERROR: Operation cancelled by user

Profiling your code is helpful, but looking at data in a table can be a real headache. Wouldnโ€™t it be nice if you could see your codeโ€™s time distribution as a heatmap? That is when pyheat comes in handy.

%%writefile pyheat_example.py
def factorial(x):
    if x == 1:
        return 1
    else:
        return (x * factorial(x-1))


if __name__ == "__main__":
    num = 5
    factorial(num)
Writing pyheat_example.py
from pyheat import PyHeat

ph = PyHeat("pyheat_example.py") # file you want to profile
ph.create_heatmap()
ph.show_heatmap()
../_images/175af9707f2f09011dd6cccaedbc6703c7c66a9d8460ed60e3b4109ea030d688.png

Link to pyheat.

7.5.8. snoop : Smart Print to Debug your Python Function#

Hide code cell content
!pip install snoop

If you want to figure out what is happening in your code without adding many print statements, try snoop.

To use snoop, simply add the @snoop decorator to a function you want to understand.

import snoop 

@snoop
def factorial(x):
    if x == 1:
        return 1
    else:
        return (x * factorial(x-1))


if __name__ == "__main__":
    num = 5
    print(f"The factorial of {num} is {factorial(num)}")
10:19:00.73 >>> Call to factorial in File "<ipython-input-2-57aff36d5f6d>", line 4
10:19:00.73 ...... x = 5
10:19:00.73    4 | def factorial(x):
10:19:00.73    5 |     if x == 1:
10:19:00.73    8 |         return (x * factorial(x-1))
    10:19:00.74 >>> Call to factorial in File "<ipython-input-2-57aff36d5f6d>", line 4
    10:19:00.74 ...... x = 4
    10:19:00.74    4 | def factorial(x):
    10:19:00.74    5 |     if x == 1:
    10:19:00.74    8 |         return (x * factorial(x-1))
        10:19:00.74 >>> Call to factorial in File "<ipython-input-2-57aff36d5f6d>", line 4
        10:19:00.74 ...... x = 3
        10:19:00.74    4 | def factorial(x):
        10:19:00.74    5 |     if x == 1:
        10:19:00.75    8 |         return (x * factorial(x-1))
            10:19:00.75 >>> Call to factorial in File "<ipython-input-2-57aff36d5f6d>", line 4
            10:19:00.75 ...... x = 2
            10:19:00.75    4 | def factorial(x):
            10:19:00.75    5 |     if x == 1:
            10:19:00.75    8 |         return (x * factorial(x-1))
                10:19:00.75 >>> Call to factorial in File "<ipython-input-2-57aff36d5f6d>", line 4
                10:19:00.75 ...... x = 1
                10:19:00.75    4 | def factorial(x):
                10:19:00.76    5 |     if x == 1:
                10:19:00.76    6 |         return 1
                10:19:00.76 <<< Return value from factorial: 1
            10:19:00.76    8 |         return (x * factorial(x-1))
            10:19:00.77 <<< Return value from factorial: 2
        10:19:00.77    8 |         return (x * factorial(x-1))
        10:19:00.77 <<< Return value from factorial: 6
    10:19:00.77    8 |         return (x * factorial(x-1))
    10:19:00.77 <<< Return value from factorial: 24
10:19:00.78    8 |         return (x * factorial(x-1))
10:19:00.78 <<< Return value from factorial: 120
The factorial of 5 is 120

7.5.9. Logging in Pandas Pipelines#

Hide code cell content
!pip install scikit-lego

When using pandas pipe, you might want to check whether each pipeline transforms your pandas DataFrame correctly. To automatically log the information of a pandas DataFrame after each pipeline, use the decorator sklego.pandas_utils.log_step.

import pandas as pd 
from sklego.pandas_utils import log_step 
import logging 
df = pd.DataFrame({"col1": [1, 2, 3], "col2": ["a", "b", "c"]})

To use log_step, simply use it as a decorator for functions being applied to your DataFrame.

@log_step(print_fn=logging.info)
def make_copy(df: pd.DataFrame):
    return df.copy()


@log_step(print_fn=logging.info)
def drop_column(df: pd.DataFrame):
    return df[["col2"]]


@log_step(print_fn=logging.info)
def encode_cat_variables(df: pd.DataFrame):
    df["col2"] = df["col2"].map({"a": 1, "b": 2, "c": 3})
    return df
df = df.pipe(make_copy).pipe(drop_column).pipe(encode_cat_variables)
INFO:root:[make_copy(df)] time=0:00:00.000239 n_obs=3, n_col=2
INFO:root:[drop_column(df)] time=0:00:00.002117 n_obs=3, n_col=1
INFO:root:[encode_cat_variables(df)] time=0:00:00.003217 n_obs=3, n_col=1

Find more ways to customize your logging here

7.5.10. Add Progress Bar to Your List Comprehension#

Hide code cell content
!pip install tqdm

If your for loop or list comprehension takes a long time to run, you might want to know which element is being processed. You can add clarity to your for-loop by using tqdm. Using tqdm with an iterable will show a progress bar.

from tqdm.notebook import tqdm
from time import sleep


def lower(word):
    sleep(1)
    print(f"Processing {word}")
    return word.lower()


words = tqdm(["Duck", "dog", "Flower", "fan"])

[lower(word) for word in words]
Processing Duck
Processing dog
Processing Flower
Processing fan
['duck', 'dog', 'flower', 'fan']

Link to tqdm.

7.5.11. Hyperfine: Compare the Speed of Two Commands#

If you want to compare the speed of two arbitrary commands, use hyperfine. hyperfine creates a statistical analysis across multiple runs and detects outliers.

In the code below, I use hyperfine to compare the execution speed of two Python files.

$ hyperfine 'python example1.py' 'python example2.py'

Link to hyperfine.

7.5.12. Timeline View of the Execution of Your Python Functions#

Have you ever wanted to see a timeline graph showing how the execution of your functions takes place in time? Prefect now allows you to do exactly that.

To use Prefect, simply add @task and @flow to your Python functions.

from prefect import task, flow
from time import sleep

@task
def task1(secs):
    sleep(secs)

@task 
def task2(secs):
    sleep(secs)

@flow
def my_flow():
    task1(2)
    task2.submit(10)
    task2.submit(2)

my_flow()

When running this code, you should see the following on your Prefect UI.

Check out the getting started tutorials for basic concepts of Prefect.

7.5.13. WAT: Your One-Stop Tool for Python Object Exploration#

Hide code cell content
!pip install wat-inspector

Inspecting object states and understanding their properties often requires tedious setup of print statements or frequent context switching between code and documentation.

With WAT, you can quickly examine an objectโ€™s type, formatted value, variables, methods, parent types, signature, and documentation - all in one view.

To use WAT, simply prepend wat/ to any object you wish to inspect.

import wat
import datetime

wat/datetime.datetime.now()
str: 2024-08-12 18:12:46.949190
repr: datetime.datetime(2024, 8, 12, 18, 12, 46, 949190)
type: datetime.datetime
parents: datetime.date

Public attributes:
  day: int = 12
  fold: int = 0
  hour: int = 18
  max: datetime.datetime = 9999-12-31 23:59:59.999999
  microsecond: int = 949190
  min: datetime.datetime = 0001-01-01 00:00:00
  minute: int = 12
  month: int = 8
  resolution: datetime.timedelta = 0:00:00.000001
  second: int = 46
  tzinfo: NoneType = None
  year: int = 2024

  def astimezone(โ€ฆ) # tz -> convert to local time in new timezone tz
  def combine(โ€ฆ) # date, time -> datetime with same date and time fields
  def ctime(โ€ฆ) # Return ctime() style string.
  def date(โ€ฆ) # Return date object with same year, month and day.
  def dst(โ€ฆ) # Return self.tzinfo.dst(self).
  def fromisocalendar(โ€ฆ) # int, int, int -> Construct a date from the ISO year, week number and weekday.โ€ฆ
  def fromisoformat(โ€ฆ) # string -> datetime from a string in most ISO 8601 formats
  def fromordinal(โ€ฆ) # int -> date corresponding to a proleptic Gregorian ordinal.
  def fromtimestamp(โ€ฆ) # timestamp[, tz] -> tz's local time from POSIX timestamp.
  def isocalendar(โ€ฆ) # Return a named tuple containing ISO year, week number, and weekday.
  def isoformat(โ€ฆ) # [sep] -> string in ISO 8601 format, YYYY-MM-DDT[HH[:MM[:SS[.mmm[uuu]]]]][+HH:MM].โ€ฆ
  def isoweekday(โ€ฆ) # Return the day of the week represented by the date.โ€ฆ
  def now(tz=None) # Returns new datetime object representing current time local to tz.โ€ฆ
  def replace(โ€ฆ) # Return datetime with new specified fields.
  def strftime(โ€ฆ) # format -> strftime() style string.
  def strptime(โ€ฆ) # string, format -> new datetime parsed from a string (like time.strptime()).
  def time(โ€ฆ) # Return time object with same time but with tzinfo=None.
  def timestamp(โ€ฆ) # Return POSIX timestamp as float.
  def timetuple(โ€ฆ) # Return time tuple, compatible with time.localtime().
  def timetz(โ€ฆ) # Return time object with same time and tzinfo.
  def today(โ€ฆ) # Current date or datetime:  same as self.__class__.fromtimestamp(time.time()).
  def toordinal(โ€ฆ) # Return proleptic Gregorian ordinal.  January 1 of year 1 is day 1.
  def tzname(โ€ฆ) # Return self.tzinfo.tzname(self).
  def utcfromtimestamp(โ€ฆ) # Construct a naive UTC datetime from a POSIX timestamp.
  def utcnow(โ€ฆ) # Return a new datetime representing UTC day and time.
  def utcoffset(โ€ฆ) # Return self.tzinfo.utcoffset(self).
  def utctimetuple(โ€ฆ) # Return UTC time tuple, compatible with time.localtime().
  def weekday(โ€ฆ) # Return the day of the week represented by the date.โ€ฆ

Link to WAT.