Pyntch

Static code analyzer for Python

Homepage Recent Changes

Last Modified: Thu Oct 29 00:29:04 JST 2009

Table of Contents:

What's It?
Download
Background
Install
How to Use
Limitations
How It Works
TODOs
Changes
Related Projects
Terms and Conditions

Download:
http://www.unixuser.org/~euske/python/pyntch/pyntch-20091028.tar.gz (50KBytes)

Discussion: (for questions and comments, post here)
http://groups.google.com/group/pyntch-users/

View the source:
http://code.google.com/p/pyntch/source/browse/trunk/pyntch

What's It?

Pyntch (pron. "pinch", originally means PYthoN Type CHecker) is a static code analyzer for Python programming language. It detects possible runtime errors before actually running a code. If you have been constantly bothered by a TypeError or AttributeError caused by giving a wrong type of objects, Pyntch is a tool for you. Pyntch examines a source code and infers all possible types of variables, attributes, function arguments, and return values of each function or method (take a look at a sample output below). Then it detects possible exceptions caused by type mismatch, attribute not found, or other types of exceptions raised from each function. Unlike other Python code checkers (such as Pychecker or Pyflakes), Pyntch does not address style issues. Pyntch normally runs pretty fast. It can perform checking for tens of thousands of lines of code within a minute.

Note: Pyntch currently supports Python 2.x only.

The following information can be gathered by static checking:

Possible types of objects that each variable, function argument or class attribte might potentially have.
Functions or instance methods that can be actually called at each function call (considering polymorphism).
Calling locations for each function or method.
Uncaught exceptions such as:
- Type mismatch (e.g. adding an integer and a string).
- Access to undefined attributes (e.g. obj.attr where obj does not have attribute attr).
- Subscript access to unsubscriptable objects (e.g. a[1] where a is not a sequence).
- Calling something uncallable (e.g. func(1) where func is not either function, method, or class).
- Iteration over non-iterable objects (e.g. sorted(x) where x is not an iterable object).

Sample Output:

(Original code)

import sys, os

# Count the total number of characters for each file in directory.
def countchars(directory):
  n = 0
  for name in os.listdir(directory):
    fp = open(name)
    for line in fp:
      n += line
    fp.fclose()
  return n

countchars(sys.argv[1])

(Annotated by Pyntch)

# os = <module os>
# sys = <module sys>
import sys, os

# Count the total number of characters for each file in directory.
def countchars(directory):
  # directory = <str>
  # fp = <file>
  # line = <str>
  # n = <int>
  # name = <str>
  # return <int>
  n = 0
  for name in os.listdir(directory):
    fp = open(name)
    for line in fp:
      # raise TypeError: not supported operand Add(int, str)
      n += line
    # raise AttributeError: attribute not found: @fp.fclose
    # raise AttributeError: attribute not found: <file>.fclose
    fp.fclose()
  return n

countchars(sys.argv[1])

Background

Have you had a TypeError caused by giving a wrong type of arguments, say, a string object to numeric functions? Or trying to access a nonexistent method of a wrongly passed class that would otherwise have such a method? One of the great advantages of scripting languages such as Python is its dynamicity. You can define any functions, variables and data structures whenever you want without elaborating the detailed type definitions. However, this feature comes with some cost: sometimes it is difficult to find potential errors that are caused by type mismatch before actually running the program.

In a language like Python, there is always a risk of uncaught runtime exceptions that a programmer could not foresee when s/he was writing the code, which causes sudden death of the program. This kind of behavior is particulary unfavorable for mission critical applications, so we want to catch these errors in advance. Unfortunately, as the program gets larger, it's getting hard to track these kinds of errors, and it's even harder to prevent them by infering which types/values can be passed or returned by each function.

Pyntch aims to help reducing these burdens by infering what kind of types can be assigned to variables/members/function arguments and what kind of types can be returned from a function at any time of execution, and what kind of exceptions might be raised. This is done by examining the code without executing it. The goal of Pyntch is to try to analyze every possible execution path and all possible combinations of data.

Sounds impossible? Well, I can show you at least this is partially possible, by using a technique called "typeflow analysis." For the details, see How it works? section. However, there's also a couple of drawbacks. Because the purpose of Pyntch is to catch as many obscure errors as possible before the code is acutally used in a production, it focuses on the coverage of the analysis at the expense of its accuracy. Sometimes Pyntch brings a lot of false positives in its result, which need to be further examined by human programmers.

Install

Install Python 2.4 or newer.
Download the Pyntch source.
Extract it.
Run setup.py to install:
```
# python setup.py install
```
Done!

How to Use

The main checking program is tchecker.py. It comes in two different running modes: summary mode and annotation mode.

In summary mode, Pyntch only shows the types of each variable and exceptions in a succinct text format. In annotation mode, there're two stages. First, the analyzer program produces an XML file that contains all information about a source code. Then another program uses that information to annotate the source texts.

Summary Mode

The basic use of summary mode is pretty simple and straightforward. Take this sample code:

$ cat -n sample.py
     1  import sys, os
     2
     3  # Count the total number of characters for each file in directory.
     4  def countchars(directory):
     5    n = 0
     6    for name in os.listdir(directory):
     7      fp = open(name)
     8      for line in fp:
     9        n += line
    10      fp.fclose()
    11    return n
    12
    13  countchars(sys.argv[1])

To check this code, simply run the tchecker.py against the source file:

$ tchecker.py sample.py
loading: 'sample.py' as 'sample'
...
total files=9, lines=1435 in 0.38sec .........................................[A]
[sample (sample.py)]
  os = <Module os (/home/euske/work/pyntch/pyntch/stub/os.pyi)>
  sys = <Module sys (/home/euske/work/pyntch/pyntch/stub/sys.pyi)>
  ### sample(4) ..............................................................[B]
  # called at sample(13) .....................................................[C]
  def countchars(directory=<str>): ...........................................[D]
    fp = <file> ..............................................................[D]
    line = <str> .............................................................[D]
    n = <int> ................................................................[D]
    name = <str> .............................................................[D]
    return = <int> ...........................................................[E]
    raises TypeError: not supported operand Add(int, str) at sample:9 ........[F]
    raises AttributeError: attribute not found: @fp.fclose at sample:10 ......[G]
    raises AttributeError: attribute not found: <file>.fclose at sample:10 ...[H]

The output shows several things:

[A]: The analyzer read 9 Python modules (1 target + 8 imported) that have 1,435 lines in total. The analysis took 0.38 seconds.
[B]: Function countchars starts from line 4 in sample1.py.
[C]: The function is called from line 13.
[D]: This shows the possible type of each variable.
[E]: The function returns an integer.
[F]: The function might raise a TypeError exception at line 10 by attempting to add an integer and string.
[G]: Attribute reference fp.fclose fails for any type of objects that fp could have contained.
[H]: A file object does not have fclose attribute. (It should have been close instead.)

Annotation Mode

In annotation mode, first you need to run the analyzer program to produce the result in an XML format. Then give the XML file to another program called annot.py. This program inserts the analysis results into appropriate places in the source code, producing an annotated source text.

Step 1:

$ tchecker -o output.xml sample.py
loading: 'sample.py' as 'sample'
...
total files=9, lines=1435 in 0.40sec

Now output.xml looks like this:

<output>
<module src="sample.py" name="sample">
  <var name="os"><compound id="0"><module name="os" /></compound></var>
  <var name="sys"><compound id="0"><module name="sys" /></compound></var>
  <function loc="sample:4:5" name="countchars">
    <caller loc="sample:13" />
    <arg name="directory"><compound id="0"><str /></compound></arg>
    <var name="fp"><compound id="0"><file /></compound></var>
    <var name="line"><compound id="0"><str /></compound></var>
    <var name="n"><compound id="0"><int /></compound></var>
    <var name="name"><compound id="0"><str /></compound></var>
    <return><compound id="0"><int /></compound></return>
    <raise msg="TypeError: not supported operand Add(int, str)" loc="sample:9" type="TypeError" />
    <raise msg="AttributeError: attribute not found: @fp.fclose" loc="sample:10" type="AttributeError" />
    <raise msg="AttributeError: attribute not found: &lt;file&gt;.fclose" loc="sample:10" type="AttributeError" />
  </function>
</module>
</output>

Step 2:

$ annot.py output.xml sample.py
# os = <module os>
# sys = <module sys>
import sys, os

# Count the total number of characters for each file in directory.
def countchars(directory):
  # directory = <str>
  # fp = <file>
  # line = <str>
  # n = <int>
  # name = <str>
  # return <int>
  n = 0
  for name in os.listdir(directory):
    fp = open(name)
    for line in fp:
      # raise TypeError: not supported operand Add(int, str)
      n += line
    # raise AttributeError: attribute not found: @fp.fclose
    # raise AttributeError: attribute not found: <file>.fclose
    fp.fclose()
  return n

countchars(sys.argv[1])

tchecker.py

tchecker.py is the main checking tool. It scans Python source codes and shows the result of analysis. The output can be later used for annotation purposes.

Syntax:

tchecker.py [options] file.py ...

Options:

-a: Shows the result of all modules that are imported, instead of the ones specified in the command line arguments.
-c config_file: Specifies a configuration file. A configuration file is a Python script that defines parameters for ErrorConfig settings.
-C key=value: Specifies a configuration parameter (described below).
-d: Increases the debug level.
-D: Ignore all the default Python module paths.
-f type: Specifies the output format. Currently txt (summary mode) or xml (annotation mode) is supported.
-o filename: Specifies an output file name. When it ends with ".xml" it automatically assumes the output format is XML.
-p >python_path: Adds a search path for Python modules. Multiple -p options are allowed.
-P stub_path: Adds a stub path for Python modules. Stub files in these directories are always used instead of a real Python module. Multiple -P options are allowed.
-q: Quiet mode. Suppresses to show the progress.

annot.py

annot.py is an annotation tool. It takes an XML output of tchecker.py and combines it with the source codes to produce annotated output. A source file can be specified either by an actual pathname or by a Python module name (as used with a "import" statement). When a pathname is used, the pathname string must be exactly the one contained in the "src" attribute in an XML output.

Syntax:

annot.py [options] output_xml file.py ...

annot.py [options] output_xml module_name ...

Options:

-p basepath: Specifies the base pathname of source files which is to be prepended to relative pathnames contained in the output_xml file.
-d: Increases the debug level.

Adding a Module Path

Pyntch can take module names instead of actual file names as input. Pyntch searches the Python search path that is specified by PYTHONPATH environment variable, as well as stub path (explained below). If you want to instruct Pyntch to look at different locations, use -p option to alter the module search path:

$ tchecker.py -p /path/to/your/modules mypackage.mymodule

Creating a Stub Module

Due to the nature of source level analysis, Pyntch cannot analyze a program that uses external modules, in which the behavior of the code is specified only in opaque binaries. In that case, a user can instruct Pyntch to use an alternative "stub" module which is written in Python and defines only the return type of each function. Python stub modules are similar to C headers, but a Python stub is a real Python code that basically does nothing than returning a particular type of objects that the "real" function would return. For example, if a Python function returns an integer and a string (depending on its input), its stub function looks like this:

def f(x):
  return 0
  return ''

Although this looks meaningless, it is a valid Python code, and since Pyntch ignores its execution order (see Limitations section), Pyntch recognizes this function as one returning an integer and/or a string.

Python stub files end with ".pyi" in their file names. They are usually placed in the default Python search path. When a Python stub and its real Python module both exist, the stub module is checked. Stub modules for several built-in modules such as sys or os.path are included in the current Pyntch distribution. They are normally placed in the Pyntch package directory (e.g. /usr/local/lib/python2.5/site-packages) and used by default instead of built-in Python modules.

Pyntch cannot correctly analyze a built-in function that returns different types of values depending on its parameters (notably struct.unpack).

Putting Extra Constraints

It is possible to inform Pyntch that a certain variable can only have a specific type. These constraints can be embedded in a source code using Python's assert statement, as in:

$ cat -n assert1.py
     1  def f(x):
     2    assert isinstance(x, (float,int))
     3    return x*2
     4
     5  f(1)
     6  f(2.3)
     7  f('foo')

$ tchecker.py assert1.py
[assert1 (assert1.py)]
  ### assert1(1)
  # called at assert1(5)
  # called at assert1(7)
  # called at assert1(6)
  def f(x=<float>|<int>):
    return = <float>|<int>
    raises TypeError: @x (<str>) must be int|float at assert1:2

Controlling warnings

Since Pyntch can produce a lot of information about a code, a user might be overwhelmed by the amount or complexity of its result. Pyntch offers a couple of ways for controlling the outputs in order to provide the desired information.

Configuration parameters:

These are defined in ErrorConfig class (config.py).

raise_uncertain (boolean): Include exceptions that might occur at runtime (such as IndexError or IOError).
ignore_none (boolean): Supress warnings for operations against a None object.
show_all_exceptions (boolean): Displays all the possible uncatched exceptions for each function, including exceptions that occur indirectly inside the internal function calls.

Limitations

One of the major drawbacks of typeflow analysis is its inability to take account of execution order (which is also true for dataflow analysis). The sequence of statements is simply ignored and all the possible order is considered. This is like considering every permutation of statements in a program and combining them into one. This sometimes brings inaccuracy to its result in exchange for a comprehensiveness of the checking. For example, consider the following two consecutive statements:

x = 1
x = 'a'

After executing these two statements, it is clear that variable x has always a string object, not an integer object. However, due to the lack of awareness of execution order, Pyntch reports this variable might have two possible types: an integer and string. Although we expect this kind of errors does not affect much to the overall usefulness of the report, we provide a way to supress this type of output. Also, Pyntch cannot detect UnboundLocalError.

Another limitation is that Pyntch assumes the scope of each namespace is statically defined, i.e. all the names (variables, functions, classes and attributes) are written down in the source code. Therefore a program that define or alter the namespace dynamically during execution cannot be correctly analyzed. Basically, a code has to meet the following conditions:

not using globals() or locals() function, nor refering to or altering __dict__ member.
not using getattr or setattr.
not using eval, compile or exec functions.
no metaclass programming.

How It Works

(This section is still way under construction.)

The basic mechanism of Pyntch is based on the idea of "typeflow analysis." This is similar to dataflow analysis, which gives the maximal set of possible data that are stored at each location (either variable or continuation) in a program. First, it constructs a big connected graph that represents the entire Python program. Every expression or statement is converted to a "node", which is an abstraact place where certain type(s) of data is stored or passed. Then it tries to figure out what type of data goes from one node to another.

Let us consider a trivial example:

A = 'xyz'
B = 2
C = a*b

Given the above statements, Pyntch constructs a graph shown in Fig. 1. A square shaped node is a "leaf" node, which represents a single type of Python object. A round shaped node is a "compound" node, which is a place that one or more types of objects can be potentially stored. Now, the data stored at the top two leaf nodes, which are a string and an integer object respectively, flow down to the lower nodes and each node passes the data according to the arrow. Both objects are "mixed" at the plus sign node, which produces a string object (because in Python multiplying a string and an integer gives a repreated string). Eventually, the object goes into variable c, which is the node at the bottom. This way, you can infer the possible type of each variable.

Fig 1. Simple expression

Handling Dynamicity

Now take an example that involves a function:

  def foo(x):
    return x+1
  def bar(y):
    return y*2
  f = foo
  z = f(1)
  f = bar
  z = f(2)

Fig 2. Handling function call

Preventing Combinatorial Explosion

(todo)

TODOs

Better representation of aggregate types (too many "..."s).
Basic execution path analysis.
Cross referencing variables/attributes.
Support special function checking (e.g. pack, unpack, array).
Add more test cases and stub modules.
Finer control of errors/warnings: ignore exceptions raised in certain class/functions.

Changes

2009/10/28: Another batch of bugfixes.
2009/10/18: Source code annotation support.
2009/09/27: Another bugfixes.
2009/09/06: More bugfixes.
2009/08/19: Bugfixes and improved speed.
2009/06/15: Several bugfixes.
2009/05/31: Initial release.
2008/03/01: Start the project.

Related Projects

Terms and Conditions

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Yusuke Shinyama