Vadikus December 27, 2009 at 05:02

OptionParser and UnitTest in python scripts

In this article I want to ask the public if I am correctly implementing the capabilities of these two wonderful python modules, which have long been included in the standard (vital) set of python guides for the entire planet.

Background

A script is being written to process and graphically display some research data. But the story is not about that. I would like to show how the combination of OptionParser and UnitTest modules is used. And for one, learn ways to improve the code and its readability. So Python gurus, I will be very grateful for your criticism and suggestions.

Modules

Everyone has heard about the Test Driven Development (TDD) method of software product development at least once in their life. Immediately with the implementation of this approach for python, I came across in the book of Macra Pilgrim's "Diving into Python 3". In the ninth chapter of his book, Mark describes in detail how to implement unit tests for his module for converting Roman numbers. The basis of this principle can be called writing a test to verify the correct execution of the code before writing the code itself.

I would like to add from myself that I knew about this method of programming for a long time, but I had never used it before for a banal reason - the time it takes to write the code almost doubles. And thus, it becomes completely inappropriate to use this approach for short scripts that perform a specific task. With this kind of scripts, it’s clear if there is an error in the code, because approximately you know from what range of distributions (statistical processing) you need to expect output data.

Designing a new task showed that the script will be quite complex. And writing Unit tests to it will be justified. Because subsequent refactoring and debugging will be greatly simplified.

Next in line is OptionParser, which I have been using for a long time, and it looks like it will be used for a long time. Code readability increases when using it. There are several parsers like this one. And at one time active holivars were conducted, about which one is better. There were accusations that he was imposing his philosophy on organizing and processing options. Honestly, I have not noticed anything “strange” in this organization. And again, this will primarily depend on the programmer how he implements the readability of one or another option. So, let us leave the holivar aside.

Source code

Let's go straight to the source code. There is only one readin_monitor (monitor) working function in the executable module so far.

Copy Source | Copy HTML#!/usr/bin/env python # -*- coding: utf-8 -*- version = '0.0.1' version_name = 'gamma' modify_date = '2009-12-26' from optparse import OptionParser import matplotlib import numpy as np import scipy.stats as stats import warnings warnings.filterwarnings('ignore', '', DeprecationWarning) # turning off deprecation warning in python 2.6 kB = 8.31441e-3 / 4.184 def readin_monitor(monitor): '''Read in monitor file. Ignoring all strings starting with # symbol. Function returns all stored data from the strings as list of lists of floats.''' num = 0 data = [] for line in open(monitor, 'r'): try: if line[ 0] != '#': data.append([float(i) for i in line.split()]) num = num + 1 except: pass if options.verbose: print('Read in %i data points from monitor file %s' % (num, monitor)) return data def main(): return 0 global options global args parser = OptionParser("usage: %prog [options] [monitors]", version='%prog ' +version+ ' from '+modify_date) parser.add_option("-v", "--verbose", action="store_true", dest="verbose", default=False, help="Print status messages to stdout") parser.add_option("-C", "--combine", dest="combine", action="store", default="", help='Combine all monitor files passed as arguments \ to the UC.py script to one COMBINE file') parser.add_option('-D', '--dimentions', dest='dimentions', default = '1:2', help='String of DIMENTIONS for monitor files to be \ read in. (defaut = %default)') (options, args) = parser.parse_args() if __name__ == '__main__': main()

Of the features of the location of the code, I would like to note the definition of parser options at the end of the module itself. Those. this piece of code will always be executed, even if the module is called by another script. Thus, in the globally defined variables options and args, there will be default values, args will be empty. Because Since global variables, then access to them will be possible from any environment.

Running the script with the -h option will give detailed help on using the options:

Copy Source | Copy HTMLUsage: UC.py [options] [monitors] Options: --version show program's version number and exit -h, --help show this help message and exit -v, --verbose Print status messages to stdout -C COMBINE, --combine=COMBINE Combine all monitor files passed as arguments to the UC.py script to one COMBINE file. (defaut = out) -D DIMENTIONS, --dimentions=DIMENTIONS String of DIMENTIONS for monitor files to be read in. (defaut = 0:1:2)

Next, unit tests themselves:

Copy Source | Copy HTML#!/usr/bin/env python # -*- coding: utf-8 -*- '''Unit tests for UC.py module.''' import UC import unittest global monitor monitor = '''# # MD time (ps), CV #1, CV #2 # 0.9990 9.2349535263 7.7537518211 1.9990 9.4331321327 7.9555258177 2.9990 9.5368308183 8.1341402536 3.9990 9.4468066031 7.9086253193 4.9990 9.1565151681 8.0027457962 5.9990 9.2310306859 7.9872398398 6.9990 9.1540695183 7.5236796623 7.9990 9.0727576308 7.8499035889 8.9990 9.3113419250 8.1227557439 9.9990 8.9597834513 8.3754973753 10.9990 9.5761421491 8.3053224696 11.9990 9.5178829977 8.1660258902''' class Combine_monitors(unittest.TestCase): def test_readin_monitor(self): with open('test_mon', 'w') as MON: MON.write(monitor) UC.options.verbose = False self.assertEqual([[ 0.999, 9.2349535263, 7.7537518210999998], [1.9990000000000001, 9.4331321327000008, 7.9555258176999999], [2.9990000000000001, 9.5368308183000003, 8.1341402536], [3.9990000000000001, 9.4468066031000006, 7.9086253192999996], [4.9989999999999997, 9.1565151681000003, 8.0027457961999993], [5.9989999999999997, 9.2310306859000004, 7.9872398398], [6.9989999999999997, 9.1540695183, 7.5236796623000002], [7.9989999999999997, 9.0727576308, 7.8499035889000002], [8.9990000000000006, 9.3113419250000007, 8.1227557439000009], [9.9990000000000006, 8.9597834512999999, 8.3754973753000002], [10.999000000000001, 9.5761421491000007, 8.3053224696000001], [11.999000000000001, 9.5178829976999992, 8.1660258902000002]], UC.readin_monitor('test_mon')) def main(): unittest.main() return 0 if __name__ == '__main__': main()

It is worth adding deletion of temporary files to the written test. And of course, increase the number of tests as new script functions are implemented. Running the script leads to this conclusion: To write this small piece of code, I had to “deceive everyone” a bit (it seems like you can translate the verb cheat). The test was written after writing the readin_monitor () function itself from the main module. The result of the function was simply thrown out by print in stdout. And from there I uploaded the test module to the source code.

$ ./test-UC.py

.

----------------------------------------------------------------------

Ran 1 test in 0.000s


OK

What does not like - seem to be fooling ourselves. First we write the code, then the test, thereby violating the philosophy of TDD development. Also, the output results, due to the specifics of the language, were not accurate (meaning 5.9989999999999997 = 5.9990 rounding). If you run the same unit test in a different version of python, you may get a test error. For Python 3.1, the test was passed positively, but I still care about such accuracy tests. You can, of course, organize rounding yourself to, say, the 5th decimal place, and compare the already rounded data. But this is fraught with a strong weighting of the code, and, as a result, poor readability thereof.

Total

Using two short scripts as an example, we showed how you can use the options of OptionParser and UnitTest modules. The purpose of the article was not a full description of all their capabilities, so the curious reader was given the opportunity to understand them myself, using the links from the beginning of the article.

Well, to the main question. What can be improved in this code / approach? Waiting for your answers.

Thanks for attention.

Tags:

OptionParser and UnitTest in python scripts

Background

Modules

Source code

Total

Also popular now: