particular revisions for benchmarks?

From: Neels J Hofmeyr <neels_at_elego.de>
Date: Thu, 05 Jul 2012 16:04:12 +0200

I've just installed the revamped benchmark scripts on our VM, which will
record all timings instead of throwing them away after being done. That way
we can one day have charts showing changes over a longer term.

Now I could run a few specific revisions manually, to populate the pretty
empty timings database. So I'd like to know if there are some particularly
interesting revisions that you'd like to see compared.

Thanks,
~Neels

On 2012-07-05 13:46, neels_at_apache.org wrote:
> Author: neels
> Date: Thu Jul 5 11:46:17 2012
> New Revision: 1357564
>
> URL: http://svn.apache.org/viewvc?rev=1357564&view=rev
> Log:
> Revamp benchmarks/suite1, as promised on the hackathon. A slight code bomb.
>
> Use an SQLite db to record all timings ever measured. Add capability to
> generate charts from recorded data. Commandline interface has changed,
> featuring a way to indicate slices of recorded timings to show/compare/chart.
>
> * tools/dev/benchmarks/suite1/benchmark.py: Revamp.
>
> * tools/dev/benchmarks/suite1/run:
> Adjust so that it does roughly the same, except, since all timings are now
> recorded, only run the 1.7.0 tag once instead of N times every week.
> Todo: Actually ask benchmark.py for charts and show them to the dev list.
>
> Modified:
> subversion/trunk/tools/dev/benchmarks/suite1/benchmark.py
> subversion/trunk/tools/dev/benchmarks/suite1/run
>
> Modified: subversion/trunk/tools/dev/benchmarks/suite1/benchmark.py
> URL: http://svn.apache.org/viewvc/subversion/trunk/tools/dev/benchmarks/suite1/benchmark.py?rev=1357564&r1=1357563&r2=1357564&view=diff
> ==============================================================================
> --- subversion/trunk/tools/dev/benchmarks/suite1/benchmark.py (original)
> +++ subversion/trunk/tools/dev/benchmarks/suite1/benchmark.py Thu Jul 5 11:46:17 2012
> @@ -17,42 +17,139 @@
> # specific language governing permissions and limitations
> # under the License.
>
> -"""
> -usage: benchmark.py run <run_file> <levels> <spread> [N]
> - benchmark.py show <run_file>
> - benchmark.py compare <run_file1> <run_file2>
> - benchmark.py combine <new_file> <run_file1> <run_file2> ...
> +"""Usage: benchmark.py run|list|compare|show|chart ...
>
> -Test data is written to run_file.
> -If a run_file exists, data is added to it.
> +RUN BENCHMARKS
> +
> + benchmark.py run <branch>@<revision>,<levels>x<spread> [N] [options]
> +
> +Test data is added to an sqlite database created automatically, by default
> +'benchmark.db' in the current working directory. To specify a different path,
> +use option -f <path_to_db>.
> +
> +<branch_name> is a label of the svn branch you're testing, e.g. "1.7.x".
> +<revision> is the last-changed-revision of above branch.
> <levels> is the number of directory levels to create
> <spread> is the number of child trees spreading off each dir level
> If <N> is provided, the run is repeated N times.
> -"""
> +
> +<branch_name> and <revision> are simply used for later reference. You
> +should enter labels matching the selected --svn-bin-dir.
> +
> +<levels> and <spread> control the way the tested working copy is structured:
> + <levels>: number of directory levels to create.
> + <spread>: number of files and subdirectories created in each dir.
> +
> +
> +LIST WHAT IS ON RECORD
> +
> + benchmark.py list [ <branch>@<rev>,<levels>x<spread> ]
> +
> +Find entries in the database for the given constraints. Any arguments can
> +be omitted. (To select only a rev, start with a '@', like '@123'; to select
> +only spread, start with an 'x', like "x100".)
> +
> +Omit all args to get a listing of all available distinct entries.
> +
> +
> +COMPARE TIMINGS
> +
> + benchmark.py compare B_at_R,LxS B_at_R,LxS
> +
> +Compare two kinds of timings (in text mode). Each B_at_R,LxS selects
> +timings from branch, revision, WC-levels and -spread by the same labels as
> +previously given for a 'run' call. Any elements can be omitted. For example:
> + benchmark.py compare 1.7.0 trunk_at_1349903
> + Compare the total timings of all combined '1.7.0' branch runs to
> + all combined runs of 'trunk'-at-revision-1349903.
> + benchmark.py compare 1.7.0,5x5 trunk_at_1349903,5x5
> + Same as above, but only compare the working copy types with 5 levels
> + and a spread of 5.
> +
> +
> +SHOW TIMINGS
> +
> + benchmark.py show <branch>@<rev>,<levels>x<spread>
> +
> +Print out a summary of the timings selected from the given constraints.
> +Any arguments can be omitted (like for the 'list' command).
> +
> +
> +GENERATE CHARTS
> +
> + benchmark.py chart compare B_at_R,LxS B_at_R,LxS [ B_at_R,LxS ... ]
> +
> +Produce a bar chart that compares any number of sets of timings. Timing sets
> +are supplied by B_at_R,LxS arguments (i.e. <branch>@<rev>,<levels>x<spread> as
> +provided for a 'run' call), where any number of elements may be omitted. The
> +less constraints you supply, the more timings are included (try it out with
> +the 'list' command). The first set is taken as a reference point for 100% and
> ++0 seconds. Each following dataset produces a set of labeled bar charts.
> +So, at least two constraint arguments must be provided.
> +
> +Use the -c option to limit charts to specific command names.
> +
> +
> +EXAMPLES
> +
> +# Run 3 benchmarks on svn 1.7.0. Timings are saved in benchmark.db.
> +# Provide label '1.7.0' and its Last-Changed-Rev for later reference.
> +# (You may also set your $PATH instead of using --svn-bin-dir.)
> +./benchmark.py run --svn-bin-dir ~/svn-prefix/1.7.0/bin 1.7.0_at_1181106,5x5 3
> +
> +# Record 3 benchmark runs on trunk, again naming its Last-Changed-Rev.
> +./benchmark.py run --svn-bin-dir ~/svn-prefix/trunk/bin trunk_at_1352725,5x5 3
> +
> +# Work with the results of above two runs
> +./benchmark.py list
> +./benchmark.py compare 1.7.0 trunk
> +./benchmark.py show 1.7.0 trunk
> +./benchmark.py chart compare 1.7.0 trunk
> +./benchmark.py chart compare 1.7.0 trunk -c "update,commit,TOTAL RUN"
> +
> +# Rebuild r1352598, run it and chart improvements since 1.7.0.
> +svn up -r1352598 ~/src/trunk
> +make -C ~/src/trunk dist-clean install
> +export PATH="$HOME/svn-prefix/trunk/bin:$PATH"
> +which svn
> +./benchmark.py run trunk_at_1352598,5x5 3
> +./benchmark.py chart compare 1.7.0 trunk_at_1352598 trunk_at_1352725 -o chart.svg
> +
> +
> +GLOBAL OPTIONS"""
>
> import os
> -import sys
> +import time
> +import datetime
> +import sqlite3
> +import optparse
> import tempfile
> import subprocess
> -import datetime
> import random
> import shutil
> -import cPickle
> -import optparse
> import stat
> +import string
>
> +IGNORE_COMMANDS = ('--version', )
> TOTAL_RUN = 'TOTAL RUN'
>
> -timings = None
> +j = os.path.join
>
> -def run_cmd(cmd, stdin=None, shell=False):
> +def time_str():
> + return time.strftime('%Y-%m-%d %H:%M:%S');
>
> - if shell:
> - printable_cmd = 'CMD: ' + cmd
> - else:
> - printable_cmd = 'CMD: ' + ' '.join(cmd)
> +def timedelta_to_seconds(td):
> + return ( float(td.seconds)
> + + float(td.microseconds) / (10**6)
> + + td.days * 24 * 60 * 60 )
> +
> +def run_cmd(cmd, stdin=None, shell=False, verbose=False):
> if options.verbose:
> - print printable_cmd
> + if shell:
> + printable_cmd = cmd
> + else:
> + printable_cmd = ' '.join(cmd)
> + print 'CMD:', printable_cmd
>
> if stdin:
> stdin_arg = subprocess.PIPE
> @@ -66,584 +163,1005 @@ def run_cmd(cmd, stdin=None, shell=False
> shell=shell)
> stdout,stderr = p.communicate(input=stdin)
>
> - if options.verbose:
> + if verbose:
> if (stdout):
> print "STDOUT: [[[\n%s]]]" % ''.join(stdout)
> if (stderr):
> print "STDERR: [[[\n%s]]]" % ''.join(stderr)
>
> - return stdout,stderr
> -
> -def timedelta_to_seconds(td):
> - return ( float(td.seconds)
> - + float(td.microseconds) / (10**6)
> - + td.days * 24 * 60 * 60 )
> -
> -
> -class Timings:
> -
> - def __init__(self, *ignore_svn_cmds):
> - self.timings = {}
> - self.current_name = None
> - self.tic_at = None
> - self.ignore = ignore_svn_cmds
> - self.name = None
> -
> - def tic(self, name):
> - if name in self.ignore:
> - return
> - self.toc()
> - self.current_name = name
> - self.tic_at = datetime.datetime.now()
> -
> - def toc(self):
> - if self.current_name and self.tic_at:
> - toc_at = datetime.datetime.now()
> - self.submit_timing(self.current_name,
> - timedelta_to_seconds(toc_at - self.tic_at))
> - self.current_name = None
> - self.tic_at = None
> -
> - def submit_timing(self, name, seconds):
> - times = self.timings.get(name)
> - if not times:
> - times = []
> - self.timings[name] = times
> - times.append(seconds)
> -
> - def min_max_avg(self, name):
> - ttimings = self.timings.get(name)
> - return ( min(ttimings),
> - max(ttimings),
> - reduce(lambda x,y: x + y, ttimings) / len(ttimings) )
> + return stdout, stderr
>
> - def summary(self):
> - s = []
> - if self.name:
> - s.append('Timings for %s' % self.name)
> - s.append(' N min max avg operation (unit is seconds)')
>
> - names = sorted(self.timings.keys())
> +_next_unique_basename_count = 0
>
> - for name in names:
> - timings = self.timings.get(name)
> - if not name or not timings: continue
> +def next_unique_basename(prefix):
> + global _next_unique_basename_count
> + _next_unique_basename_count += 1
> + return '_'.join((prefix, str(_next_unique_basename_count)))
>
> - tmin, tmax, tavg = self.min_max_avg(name)
>
> - s.append('%5d %7.2f %7.2f %7.2f %s' % (
> - len(timings),
> - tmin,
> - tmax,
> - tavg,
> - name))
> +def split_arg_once(l_r, sep):
> + if not l_r:
> + return (None, None)
> + if sep in l_r:
> + l, r = l_r.split(sep)
> + else:
> + l = l_r
> + r = None
> + if not l:
> + l = None
> + if not r:
> + r = None
> + return (l, r)
> +
> +RUN_KIND_SEPARATORS=('@', ',', 'x')
> +
> +class RunKind:
> + def __init__(self, b_r_l_s):
> + b_r, l_s = split_arg_once(b_r_l_s, RUN_KIND_SEPARATORS[1])
> + self.branch, self.revision = split_arg_once(b_r, RUN_KIND_SEPARATORS[0])
> + self.levels, self.spread = split_arg_once(l_s, RUN_KIND_SEPARATORS[2])
> + if self.levels: self.levels = int(self.levels)
> + if self.spread: self.spread = int(self.spread)
> +
> + label_parts = []
> + if self.branch:
> + label_parts.append(self.branch)
> + if self.revision:
> + label_parts.append(RUN_KIND_SEPARATORS[0])
> + label_parts.append(self.revision)
> + if self.levels or self.spread:
> + label_parts.append(RUN_KIND_SEPARATORS[1])
> + if self.levels:
> + label_parts.append(str(self.levels))
> + if self.spread:
> + label_parts.append(RUN_KIND_SEPARATORS[2])
> + label_parts.append(str(self.spread))
> + self.label = ''.join(label_parts)
> +
> + def args(self):
> + return (self.branch, self.revision, self.levels, self.spread)
> +
> +
> +PATHNAME_VALID_CHARS = "-_.,@%s%s" % (string.ascii_letters, string.digits)
> +def filesystem_safe_string(s):
> + return ''.join(c for c in s if c in PATHNAME_VALID_CHARS)
> +
> +def do_div(a, b):
> + if b:
> + return float(a) / float(b)
> + else:
> + return 0.0
>
> - return '\n'.join(s)
> +def do_diff(a, b):
> + return float(a) - float(b)
>
>
> - def compare_to(self, other, verbose):
> - def do_div(a, b):
> - if b:
> - return float(a) / float(b)
> - else:
> - return 0.0
> +# ------------------------- database -------------------------
>
> - def do_diff(a, b):
> - return float(a) - float(b)
> +class TimingsDb:
> + def __init__(self, db_path):
> + self.db_path = db_path;
> + self.conn = sqlite3.connect(db_path)
> + self.ensure_tables_created()
>
> - selfname = self.name
> - if not selfname:
> - selfname = 'unnamed'
> - othername = other.name
> - if not othername:
> - othername = 'the other'
> -
> - selftotal = self.min_max_avg(TOTAL_RUN)[2]
> - othertotal = other.min_max_avg(TOTAL_RUN)[2]
> -
> - s = ['COMPARE %s to %s' % (othername, selfname)]
> -
> - if TOTAL_RUN in self.timings and TOTAL_RUN in other.timings:
> - s.append(' %s timings: %5.1f seconds avg for %s'
> - % (TOTAL_RUN, othertotal, othername))
> - s.append(' %s %5.1f seconds avg for %s'
> - % (' ' * len(TOTAL_RUN), selftotal, selfname))
> + def ensure_tables_created(self):
> + c = self.conn.cursor()
>
> + c.execute("""SELECT name FROM sqlite_master WHERE type='table' AND
> + name='batch'""")
> + if c.fetchone():
> + # exists
> + return
>
> - if not verbose:
> - s.append(' avg operation')
> + print 'Creating database tables.'
> + c.executescript('''
> + CREATE TABLE batch (
> + batch_id INTEGER PRIMARY KEY AUTOINCREMENT,
> + started TEXT,
> + ended TEXT
> + );
> +
> + CREATE TABLE run_kind (
> + run_kind_id INTEGER PRIMARY KEY AUTOINCREMENT,
> + branch TEXT NOT NULL,
> + revision TEXT NOT NULL,
> + wc_levels INTEGER,
> + wc_spread INTEGER,
> + UNIQUE(branch, revision, wc_levels, wc_spread)
> + );
> +
> + CREATE TABLE run (
> + run_id INTEGER PRIMARY KEY AUTOINCREMENT,
> + batch_id INTEGER NOT NULL REFERENCES batch(batch_id),
> + run_kind_id INTEGER NOT NULL REFERENCES run_kind(run_kind_id),
> + started TEXT,
> + ended TEXT,
> + aborted INTEGER
> + );
> +
> + CREATE TABLE timings (
> + run_id INTEGER NOT NULL REFERENCES run(run_id),
> + command TEXT NOT NULL,
> + sequence INTEGER,
> + timing REAL
> + );'''
> + )
> + self.conn.commit()
> + c.close();
> +
> +
> +class Batch:
> + def __init__(self, db):
> + self.db = db
> + self.started = time_str()
> + c = db.conn.cursor()
> + c.execute("INSERT INTO batch (started) values (?)", (self.started,))
> + db.conn.commit()
> + self.id = c.lastrowid
> + c.close()
> +
> + def done(self):
> + conn = self.db.conn
> + c = conn.cursor()
> + c.execute("""
> + UPDATE batch
> + SET ended = ?
> + WHERE batch_id = ?""",
> + (time_str(), self.id))
> + conn.commit()
> + c.close()
> +
> +class Run:
> + def __init__(self, batch, run_kind):
> + self.batch = batch
> + conn = self.batch.db.conn
> + c = conn.cursor()
> +
> + c.execute("""
> + SELECT run_kind_id FROM run_kind
> + WHERE branch = ?
> + AND revision = ?
> + AND wc_levels = ?
> + AND wc_spread = ?""",
> + run_kind.args())
> + kind_ids = c.fetchone()
> + if kind_ids:
> + kind_id = kind_ids[0]
> else:
> - s.append(' min max avg operation')
> -
> - names = sorted(self.timings.keys())
> -
> - for name in names:
> - if not name in other.timings:
> - continue
> + c.execute("""
> + INSERT INTO run_kind (branch, revision, wc_levels, wc_spread)
> + VALUES (?, ?, ?, ?)""",
> + run_kind.args())
> + conn.commit()
> + kind_id = c.lastrowid
> +
> + self.started = time_str()
> +
> + c.execute("""
> + INSERT INTO run
> + (batch_id, run_kind_id, started)
> + VALUES
> + (?, ?, ?)""",
> + (self.batch.id, kind_id, self.started))
> + conn.commit()
> + self.id = c.lastrowid
> + c.close();
> + self.tic_at = None
> + self.current_command = None
> + self.timings = []
>
> + def tic(self, command):
> + if command in IGNORE_COMMANDS:
> + return
> + self.toc()
> + self.current_command = command
> + self.tic_at = datetime.datetime.now()
>
> - min_me, max_me, avg_me = self.min_max_avg(name)
> - min_other, max_other, avg_other = other.min_max_avg(name)
> + def toc(self):
> + if self.current_command and self.tic_at:
> + toc_at = datetime.datetime.now()
> + self.remember_timing(self.current_command,
> + timedelta_to_seconds(toc_at - self.tic_at))
> + self.current_command = None
> + self.tic_at = None
>
> - avg_str = '%7.2f|%+7.3f' % (do_div(avg_me, avg_other),
> - do_diff(avg_me, avg_other))
> + def remember_timing(self, command, seconds):
> + self.timings.append((command, seconds))
>
> - if not verbose:
> - s.append('%-16s %s' % (avg_str, name))
> + def submit_timings(self):
> + conn = self.batch.db.conn
> + c = conn.cursor()
> + print 'submitting...'
> +
> + c.executemany("""
> + INSERT INTO timings
> + (run_id, command, sequence, timing)
> + VALUES
> + (?, ?, ?, ?)""",
> + [(self.id, t[0], (i + 1), t[1]) for i,t in enumerate(self.timings)])
> +
> + conn.commit()
> + c.close()
> +
> + def done(self, aborted=False):
> + conn = self.batch.db.conn
> + c = conn.cursor()
> + c.execute("""
> + UPDATE run
> + SET ended = ?, aborted = ?
> + WHERE run_id = ?""",
> + (time_str(), aborted, self.id))
> + conn.commit()
> + c.close()
> +
> +
> +class TimingQuery:
> + def __init__(self, db, run_kind):
> + self.cursor = db.conn.cursor()
> + self.constraints = []
> + self.values = []
> + self.timings = None
> + self.FROM_WHERE = """
> + FROM batch AS b,
> + timings AS t,
> + run AS r,
> + run_kind as k
> + WHERE
> + t.run_id = r.run_id
> + AND k.run_kind_id = r.run_kind_id
> + AND b.batch_id = r.batch_id
> + AND r.aborted = 0
> + """
> + self.append_constraint('k', 'branch', run_kind.branch)
> + self.append_constraint('k', 'revision', run_kind.revision)
> + self.append_constraint('k', 'wc_levels', run_kind.levels)
> + self.append_constraint('k', 'wc_spread', run_kind.spread)
> + self.label = run_kind.label
> +
> + def append_constraint(self, table, name, val):
> + if val:
> + self.constraints.append('AND %s.%s = ?' % (table, name))
> + self.values.append(val)
> +
> + def remove_last_constraint(self):
> + del self.constraints[-1]
> + del self.values[-1]
> +
> + def get_sorted_X(self, x, n=1):
> + query = ['SELECT DISTINCT %s' % x,
> + self.FROM_WHERE ]
> + query.extend(self.constraints)
> + query.append('ORDER BY %s' % x)
> + c = db.conn.cursor()
> + try:
> + #print ' '.join(query)
> + c.execute(' '.join(query), self.values)
> + if n == 1:
> + return [tpl[0] for tpl in c.fetchall()]
> else:
> - min_str = '%7.2f|%+7.3f' % (do_div(min_me, min_other),
> - do_diff(min_me, min_other))
> - max_str = '%7.2f|%+7.3f' % (do_div(max_me, max_other),
> - do_diff(max_me, max_other))
> -
> - s.append('%-16s %-16s %-16s %s' % (min_str, max_str, avg_str, name))
> -
> - s.extend([
> - '(legend: "1.23|+0.45" means: slower by factor 1.23 and by 0.45 seconds;',
> - ' factor < 1 and difference < 0 means \'%s\' is faster than \'%s\')'
> - % (self.name, othername)])
> -
> - return '\n'.join(s)
> -
> -
> - def add(self, other):
> - for name, other_times in other.timings.items():
> - my_times = self.timings.get(name)
> - if not my_times:
> - my_times = []
> - self.timings[name] = my_times
> - my_times.extend(other_times)
> + return c.fetchall()
> + finally:
> + c.close()
>
> + def get_sorted_command_names(self):
> + return self.get_sorted_X('t.command')
>
> + def get_sorted_branches(self):
> + return self.get_sorted_X('k.branch')
>
> + def get_sorted_revisions(self):
> + return self.get_sorted_X('k.revision')
> +
> + def get_sorted_levels_spread(self):
> + return self.get_sorted_X('k.wc_levels,k.wc_spread', n = 2)
> +
> + def count_runs_batches(self):
> + query = ["""SELECT
> + count(DISTINCT r.run_id),
> + count(DISTINCT b.batch_id)""",
> + self.FROM_WHERE ]
> + query.extend(self.constraints)
> + c = db.conn.cursor()
> + try:
> + #print ' '.join(query)
> + c.execute(' '.join(query), self.values)
> + return c.fetchone()
> + finally:
> + c.close()
>
> -j = os.path.join
> + def get_command_timings(self, command):
> + query = ["""SELECT
> + count(t.timing),
> + min(t.timing),
> + max(t.timing),
> + avg(t.timing)""",
> + self.FROM_WHERE ]
> + self.append_constraint('t', 'command', command)
> + try:
> + query.extend(self.constraints)
> + c = db.conn.cursor()
> + try:
> + c.execute(' '.join(query), self.values)
> + return c.fetchone()
> + finally:
> + c.close()
> + finally:
> + self.remove_last_constraint()
>
> -_create_count = 0
> + def get_timings(self):
> + if self.timings:
> + return self.timings
> + self.timings = {}
> + for command_name in self.get_sorted_command_names():
> + self.timings[command_name] = self.get_command_timings(command_name)
> + return self.timings
> +
>
> -def next_name(prefix):
> - global _create_count
> - _create_count += 1
> - return '_'.join((prefix, str(_create_count)))
> +# ------------------------------------------------------------ run tests
>
> -def create_tree(in_dir, levels, spread=5):
> - try:
> - os.mkdir(in_dir)
> - except:
> - pass
> -
> - for i in range(spread):
> - # files
> - fn = j(in_dir, next_name('file'))
> - f = open(fn, 'w')
> - f.write('This is %s\n' % fn)
> - f.close()
>
> - # dirs
> - if (levels > 1):
> - dn = j(in_dir, next_name('dir'))
> - create_tree(dn, levels - 1, spread)
> +def perform_run(batch, run_kind,
> + svn_bin, svnadmin_bin, verbose):
>
> + run = Run(batch, run_kind)
>
> -def svn(*args):
> - name = args[0]
> + def create_tree(in_dir, _levels, _spread):
> + try:
> + os.mkdir(in_dir)
> + except:
> + pass
> +
> + for i in range(_spread):
> + # files
> + fn = j(in_dir, next_unique_basename('file'))
> + f = open(fn, 'w')
> + f.write('This is %s\n' % fn)
> + f.close()
> +
> + # dirs
> + if (_levels > 1):
> + dn = j(in_dir, next_unique_basename('dir'))
> + create_tree(dn, _levels - 1, _spread)
> +
> + def svn(*args):
> + name = args[0]
> +
> + cmd = [ svn_bin ]
> + cmd.extend( list(args) )
> + if verbose:
> + print 'svn cmd:', ' '.join(cmd)
> +
> + stdin = None
> + if stdin:
> + stdin_arg = subprocess.PIPE
> + else:
> + stdin_arg = None
>
> - ### options comes from the global namespace; it should be passed
> - cmd = [options.svn] + list(args)
> - if options.verbose:
> - print 'svn cmd:', ' '.join(cmd)
> + run.tic(name)
> + try:
> + p = subprocess.Popen(cmd,
> + stdin=stdin_arg,
> + stdout=subprocess.PIPE,
> + stderr=subprocess.PIPE,
> + shell=False)
> + stdout,stderr = p.communicate(input=stdin)
> + except OSError:
> + stdout = stderr = None
> + finally:
> + run.toc()
>
> - stdin = None
> - if stdin:
> - stdin_arg = subprocess.PIPE
> - else:
> - stdin_arg = None
> + if verbose:
> + if (stdout):
> + print "STDOUT: [[[\n%s]]]" % ''.join(stdout)
> + if (stderr):
> + print "STDERR: [[[\n%s]]]" % ''.join(stderr)
>
> - ### timings comes from the global namespace; it should be passed
> - timings.tic(name)
> - try:
> - p = subprocess.Popen(cmd,
> - stdin=stdin_arg,
> - stdout=subprocess.PIPE,
> - stderr=subprocess.PIPE,
> - shell=False)
> - stdout,stderr = p.communicate(input=stdin)
> - except OSError:
> - stdout = stderr = None
> - finally:
> - timings.toc()
> -
> - if options.verbose:
> - if (stdout):
> - print "STDOUT: [[[\n%s]]]" % ''.join(stdout)
> - if (stderr):
> - print "STDERR: [[[\n%s]]]" % ''.join(stderr)
> + return stdout,stderr
>
> - return stdout,stderr
>
> + def add(*args):
> + return svn('add', *args)
>
> -def add(*args):
> - return svn('add', *args)
> + def ci(*args):
> + return svn('commit', '-mm', *args)
>
> -def ci(*args):
> - return svn('commit', '-mm', *args)
> + def up(*args):
> + return svn('update', *args)
>
> -def up(*args):
> - return svn('update', *args)
> + def st(*args):
> + return svn('status', *args)
>
> -def st(*args):
> - return svn('status', *args)
> + def info(*args):
> + return svn('info', *args)
>
> -def info(*args):
> - return svn('info', *args)
> + _chars = [chr(x) for x in range(ord('a'), ord('z') +1)]
>
> -_chars = [chr(x) for x in range(ord('a'), ord('z') +1)]
> + def randstr(len=8):
> + return ''.join( [random.choice(_chars) for i in range(len)] )
>
> -def randstr(len=8):
> - return ''.join( [random.choice(_chars) for i in range(len)] )
> + def _copy(path):
> + dest = next_unique_basename(path + '_copied')
> + svn('copy', path, dest)
>
> -def _copy(path):
> - dest = next_name(path + '_copied')
> - svn('copy', path, dest)
> + def _move(path):
> + dest = path + '_moved'
> + svn('move', path, dest)
>
> -def _move(path):
> - dest = path + '_moved'
> - svn('move', path, dest)
> + def _propmod(path):
> + so, se = svn('proplist', path)
> + propnames = [line.strip() for line in so.strip().split('\n')[1:]]
>
> -def _propmod(path):
> - so, se = svn('proplist', path)
> - propnames = [line.strip() for line in so.strip().split('\n')[1:]]
> + # modify?
> + if len(propnames):
> + svn('ps', propnames[len(propnames) / 2], randstr(), path)
>
> - # modify?
> - if len(propnames):
> - svn('ps', propnames[len(propnames) / 2], randstr(), path)
> + # del?
> + if len(propnames) > 1:
> + svn('propdel', propnames[len(propnames) / 2], path)
>
> - # del?
> - if len(propnames) > 1:
> - svn('propdel', propnames[len(propnames) / 2], path)
> + def _propadd(path):
> + # set a new one.
> + svn('propset', randstr(), randstr(), path)
>
> + def _mod(path):
> + if os.path.isdir(path):
> + return _propmod(path)
>
> -def _propadd(path):
> - # set a new one.
> - svn('propset', randstr(), randstr(), path)
> + f = open(path, 'a')
> + f.write('\n%s\n' % randstr())
> + f.close()
>
> + def _add(path):
> + if os.path.isfile(path):
> + return _mod(path)
> +
> + if random.choice((True, False)):
> + # create a dir
> + svn('mkdir', j(path, next_unique_basename('new_dir')))
> + else:
> + # create a file
> + new_path = j(path, next_unique_basename('new_file'))
> + f = open(new_path, 'w')
> + f.write(randstr())
> + f.close()
> + svn('add', new_path)
> +
> + def _del(path):
> + svn('delete', path)
> +
> + _mod_funcs = (_mod, _add, _propmod, _propadd, )#_copy,) # _move, _del)
> +
> + def modify_tree(in_dir, fraction):
> + child_names = os.listdir(in_dir)
> + for child_name in child_names:
> + if child_name[0] == '.':
> + continue
> + if random.random() < fraction:
> + path = j(in_dir, child_name)
> + random.choice(_mod_funcs)(path)
>
> -def _mod(path):
> - if os.path.isdir(path):
> - return _propmod(path)
> + for child_name in child_names:
> + if child_name[0] == '.': continue
> + path = j(in_dir, child_name)
> + if os.path.isdir(path):
> + modify_tree(path, fraction)
>
> - f = open(path, 'a')
> - f.write('\n%s\n' % randstr())
> - f.close()
> + def propadd_tree(in_dir, fraction):
> + for child_name in os.listdir(in_dir):
> + if child_name[0] == '.': continue
> + path = j(in_dir, child_name)
> + if random.random() < fraction:
> + _propadd(path)
> + if os.path.isdir(path):
> + propadd_tree(path, fraction)
> +
> +
> + def rmtree_onerror(func, path, exc_info):
> + """Error handler for ``shutil.rmtree``.
> +
> + If the error is due to an access error (read only file)
> + it attempts to add write permission and then retries.
> +
> + If the error is for another reason it re-raises the error.
> +
> + Usage : ``shutil.rmtree(path, onerror=onerror)``
> + """
> + if not os.access(path, os.W_OK):
> + # Is the error an access error ?
> + os.chmod(path, stat.S_IWUSR)
> + func(path)
> + else:
> + raise
>
> -def _add(path):
> - if os.path.isfile(path):
> - return _mod(path)
> + base = tempfile.mkdtemp()
>
> - if random.choice((True, False)):
> - # create a dir
> - svn('mkdir', j(path, next_name('new_dir')))
> - else:
> - # create a file
> - new_path = j(path, next_name('new_file'))
> - f = open(new_path, 'w')
> - f.write(randstr())
> - f.close()
> - svn('add', new_path)
> + # ensure identical modifications for every run
> + random.seed(0)
>
> -def _del(path):
> - svn('delete', path)
> + aborted = True
>
> -_mod_funcs = (_mod, _add, _propmod, _propadd, )#_copy,) # _move, _del)
> + try:
> + repos = j(base, 'repos')
> + repos = repos.replace('\\', '/')
> + wc = j(base, 'wc')
> + wc2 = j(base, 'wc2')
>
> -def modify_tree(in_dir, fraction):
> - child_names = os.listdir(in_dir)
> - for child_name in child_names:
> - if child_name[0] == '.':
> - continue
> - if random.random() < fraction:
> - path = j(in_dir, child_name)
> - random.choice(_mod_funcs)(path)
> + if repos.startswith('/'):
> + file_url = 'file://%s' % repos
> + else:
> + file_url = 'file:///%s' % repos
>
> - for child_name in child_names:
> - if child_name[0] == '.': continue
> - path = j(in_dir, child_name)
> - if os.path.isdir(path):
> - modify_tree(path, fraction)
> + print '\nRunning svn benchmark in', base
> + print 'dir levels: %s; new files and dirs per leaf: %s' %(
> + run_kind.levels, run_kind.spread)
>
> -def propadd_tree(in_dir, fraction):
> - for child_name in os.listdir(in_dir):
> - if child_name[0] == '.': continue
> - path = j(in_dir, child_name)
> - if random.random() < fraction:
> - _propadd(path)
> - if os.path.isdir(path):
> - propadd_tree(path, fraction)
> + started = datetime.datetime.now()
>
> + try:
> + run_cmd([svnadmin_bin, 'create', repos])
> + svn('checkout', file_url, wc)
>
> -def rmtree_onerror(func, path, exc_info):
> - """Error handler for ``shutil.rmtree``.
> + trunk = j(wc, 'trunk')
> + create_tree(trunk, run_kind.levels, run_kind.spread)
> + add(trunk)
> + st(wc)
> + ci(wc)
> + up(wc)
> + propadd_tree(trunk, 0.5)
> + ci(wc)
> + up(wc)
> + st(wc)
> + info('-R', wc)
> +
> + trunk_url = file_url + '/trunk'
> + branch_url = file_url + '/branch'
> +
> + svn('copy', '-mm', trunk_url, branch_url)
> + st(wc)
> +
> + up(wc)
> + st(wc)
> + info('-R', wc)
> +
> + svn('checkout', trunk_url, wc2)
> + st(wc2)
> + modify_tree(wc2, 0.5)
> + st(wc2)
> + ci(wc2)
> + up(wc2)
> + up(wc)
> +
> + svn('switch', branch_url, wc2)
> + modify_tree(wc2, 0.5)
> + st(wc2)
> + info('-R', wc2)
> + ci(wc2)
> + up(wc2)
> + up(wc)
> +
> + modify_tree(trunk, 0.5)
> + st(wc)
> + ci(wc)
> + up(wc2)
> + up(wc)
> +
> + svn('merge', '--accept=postpone', trunk_url, wc2)
> + st(wc2)
> + info('-R', wc2)
> + svn('resolve', '--accept=mine-conflict', wc2)
> + st(wc2)
> + svn('resolved', '-R', wc2)
> + st(wc2)
> + info('-R', wc2)
> + ci(wc2)
> + up(wc2)
> + up(wc)
> +
> + svn('merge', '--accept=postpone', '--reintegrate', branch_url, trunk)
> + st(wc)
> + svn('resolve', '--accept=mine-conflict', wc)
> + st(wc)
> + svn('resolved', '-R', wc)
> + st(wc)
> + ci(wc)
> + up(wc2)
> + up(wc)
> +
> + svn('delete', j(wc, 'branch'))
> + ci(wc)
> + up(wc)
>
> - If the error is due to an access error (read only file)
> - it attempts to add write permission and then retries.
> + aborted = False
>
> - If the error is for another reason it re-raises the error.
> + finally:
> + stopped = datetime.datetime.now()
> + print '\nDone with svn benchmark in', (stopped - started)
>
> - Usage : ``shutil.rmtree(path, onerror=onerror)``
> - """
> - if not os.access(path, os.W_OK):
> - # Is the error an access error ?
> - os.chmod(path, stat.S_IWUSR)
> - func(path)
> - else:
> - raise
> + run.remember_timing(TOTAL_RUN,
> + timedelta_to_seconds(stopped - started))
> + finally:
> + run.done(aborted)
> + run.submit_timings()
> + shutil.rmtree(base, onerror=rmtree_onerror)
>
> + return aborted
>
> -def run(levels, spread, N):
> - for i in range(N):
> - base = tempfile.mkdtemp()
>
> - # ensure identical modifications for every run
> - random.seed(0)
> +# ---------------------------------------------------------------------
>
> - try:
> - repos = j(base, 'repos')
> - repos = repos.replace('\\', '/')
> - wc = j(base, 'wc')
> - wc2 = j(base, 'wc2')
> +
> +def cmdline_run(db, options, run_kind_str, N=1):
> + run_kind = RunKind(run_kind_str)
> + N = int(N)
>
> - if repos.startswith('/'):
> - file_url = 'file://%s' % repos
> - else:
> - file_url = 'file:///%s' % repos
> + print 'Hi, going to run a Subversion benchmark series of %d runs...' % N
> + print 'Label is %s' % run_kind.label
>
> - so, se = svn('--version')
> - if not so:
> - ### options comes from the global namespace; it should be passed
> - print "Can't find svn at", options.svn
> - exit(1)
> - version = ', '.join([s.strip() for s in so.split('\n')[:2]])
> -
> - print '\nRunning svn benchmark in', base
> - print 'dir levels: %s; new files and dirs per leaf: %s; run %d of %d' %(
> - levels, spread, i + 1, N)
> + # can we run the svn binaries?
> + svn_bin = j(options.svn_bin_dir, 'svn')
> + svnadmin_bin = j(options.svn_bin_dir, 'svnadmin')
> +
> + for b in (svn_bin, svnadmin_bin):
> + so,se = run_cmd([b, '--version'])
> + if not so:
> + print "Can't run", b
> + exit(1)
>
> - print version
> - started = datetime.datetime.now()
> + print ', '.join([s.strip() for s in so.split('\n')[:2]])
>
> - try:
> - run_cmd(['svnadmin', 'create', repos])
> - svn('checkout', file_url, wc)
> + batch = Batch(db)
>
> - trunk = j(wc, 'trunk')
> - create_tree(trunk, levels, spread)
> - add(trunk)
> - st(wc)
> - ci(wc)
> - up(wc)
> - propadd_tree(trunk, 0.5)
> - ci(wc)
> - up(wc)
> - st(wc)
> - info('-R', wc)
> -
> - trunk_url = file_url + '/trunk'
> - branch_url = file_url + '/branch'
> -
> - svn('copy', '-mm', trunk_url, branch_url)
> - st(wc)
> -
> - up(wc)
> - st(wc)
> - info('-R', wc)
> -
> - svn('checkout', trunk_url, wc2)
> - st(wc2)
> - modify_tree(wc2, 0.5)
> - st(wc2)
> - ci(wc2)
> - up(wc2)
> - up(wc)
> -
> - svn('switch', branch_url, wc2)
> - modify_tree(wc2, 0.5)
> - st(wc2)
> - info('-R', wc2)
> - ci(wc2)
> - up(wc2)
> - up(wc)
> -
> - modify_tree(trunk, 0.5)
> - st(wc)
> - ci(wc)
> - up(wc2)
> - up(wc)
> -
> - svn('merge', '--accept=postpone', trunk_url, wc2)
> - st(wc2)
> - info('-R', wc2)
> - svn('resolve', '--accept=mine-conflict', wc2)
> - st(wc2)
> - svn('resolved', '-R', wc2)
> - st(wc2)
> - info('-R', wc2)
> - ci(wc2)
> - up(wc2)
> - up(wc)
> -
> - svn('merge', '--accept=postpone', '--reintegrate', branch_url, trunk)
> - st(wc)
> - svn('resolve', '--accept=mine-conflict', wc)
> - st(wc)
> - svn('resolved', '-R', wc)
> - st(wc)
> - ci(wc)
> - up(wc2)
> - up(wc)
> -
> - svn('delete', j(wc, 'branch'))
> - ci(wc)
> - up(wc2)
> - up(wc)
> + for i in range(N):
> + print 'Run %d of %d' % (i + 1, N)
> + perform_run(batch, run_kind,
> + svn_bin, svnadmin_bin, options.verbose)
> +
> + batch.done()
> +
> +
> +def cmdline_list(db, options, run_kind_str=None):
> + run_kind = RunKind(run_kind_str)
> +
> + constraints = []
> + def add_if_not_none(name, val):
> + if val:
> + constraints.append(' %s = %s' % (name, val))
> + add_if_not_none('branch', run_kind.branch)
> + add_if_not_none('revision', run_kind.revision)
> + add_if_not_none('levels', run_kind.levels)
> + add_if_not_none('spread', run_kind.spread)
> + if constraints:
> + print 'For\n', '\n'.join(constraints)
> + print 'I found:'
> +
> + d = TimingQuery(db, run_kind)
> +
> + cmd_names = d.get_sorted_command_names()
> + if cmd_names:
> + print '\n%d command names:\n ' % len(cmd_names), '\n '.join(cmd_names)
> +
> + branches = d.get_sorted_branches()
> + if branches and (len(branches) > 1 or branches[0] != run_kind.branch):
> + print '\n%d branches:\n ' % len(branches), '\n '.join(branches)
> +
> + revisions = d.get_sorted_revisions()
> + if revisions and (len(revisions) > 1 or revisions[0] != run_kind.revision):
> + print '\n%d revisions:\n ' % len(revisions), '\n '.join(revisions)
> +
> + levels_spread = d.get_sorted_levels_spread()
> + if levels_spread and (
> + len(levels_spread) > 1
> + or levels_spread[0] != (run_kind.levels, run_kind.spread)):
> + print '\n%d kinds of levels x spread:\n ' % len(levels_spread), '\n '.join(
> + [ ('%dx%d' % (l, s)) for l,s in levels_spread ])
> +
> + print "\n%d runs in %d batches.\n" % (d.count_runs_batches())
> +
> +
> +def cmdline_show(db, options, *run_kind_strings):
> + for run_kind_str in run_kind_strings:
> + run_kind = RunKind(run_kind_str)
>
> + q = TimingQuery(db, run_kind)
> + timings = q.get_timings()
>
> - finally:
> - stopped = datetime.datetime.now()
> - print '\nDone with svn benchmark in', (stopped - started)
> + s = []
> + s.append('Timings for %s' % run_kind.label)
> + s.append(' N min max avg operation (unit is seconds)')
>
> - ### timings comes from the global namespace; it should be passed
> - timings.submit_timing(TOTAL_RUN,
> - timedelta_to_seconds(stopped - started))
> -
> - # rename ps to prop mod
> - if timings.timings.get('ps'):
> - has = timings.timings.get('prop mod')
> - if not has:
> - has = []
> - timings.timings['prop mod'] = has
> - has.extend( timings.timings['ps'] )
> - del timings.timings['ps']
> + for command_name in q.get_sorted_command_names():
> + if options.command_names and command_name not in options.command_names:
> + continue
> + n, tmin, tmax, tavg = timings[command_name]
>
> - print timings.summary()
> - finally:
> - shutil.rmtree(base, onerror=rmtree_onerror)
> + s.append('%5d %7.2f %7.2f %7.2f %s' % (
> + n,
> + tmin,
> + tmax,
> + tavg,
> + command_name))
>
> + print '\n'.join(s)
>
> -def read_from_file(file_path):
> - f = open(file_path, 'rb')
> - try:
> - instance = cPickle.load(f)
> - instance.name = os.path.basename(file_path)
> - finally:
> - f.close()
> - return instance
>
> +def cmdline_compare(db, options, left_str, right_str):
> + left_kind = RunKind(left_str)
> + right_kind = RunKind(right_str)
>
> -def write_to_file(file_path, instance):
> - f = open(file_path, 'wb')
> - cPickle.dump(instance, f)
> - f.close()
> + leftq = TimingQuery(db, left_kind)
> + left = leftq.get_timings()
> + if not left:
> + print "No timings for", left_kind.label
> + exit(1)
>
> -def cmd_compare(path1, path2):
> - t1 = read_from_file(path1)
> - t2 = read_from_file(path2)
> + rightq = TimingQuery(db, right_kind)
> + right = rightq.get_timings()
> + if not right:
> + print "No timings for", right_kind.label
> + exit(1)
>
> - if options.verbose:
> - print t1.summary()
> - print '---'
> - print t2.summary()
> - print '---'
> - print t2.compare_to(t1, options.verbose)
> -
> -def cmd_combine(dest, *paths):
> - total = Timings('--version');
> -
> - for path in paths:
> - t = read_from_file(path)
> - total.add(t)
> -
> - print total.summary()
> - write_to_file(dest, total)
> -
> -def cmd_run(timings_path, levels, spread, N=1):
> - levels = int(levels)
> - spread = int(spread)
> - N = int(N)
> + label = 'Compare %s to %s' % (left_kind.label, right_kind.label)
>
> - print '\n\nHi, going to run a Subversion benchmark series of %d runs...' % N
> + s = [label]
>
> - ### UGH! should pass to run()
> - ### neels: Today I contemplated doing that, but at the end of the day
> - ### it merely blows up the code without much benefit. If this
> - ### ever becomes part of an imported python package, call again.
> - global timings
> -
> - if os.path.isfile(timings_path):
> - print 'Going to add results to existing file', timings_path
> - timings = read_from_file(timings_path)
> + verbose = options.verbose
> + if not verbose:
> + s.append(' N avg operation')
> else:
> - print 'Going to write results to new file', timings_path
> - timings = Timings('--version')
> + s.append(' N min max avg operation')
>
> - run(levels, spread, N)
> + command_names = [name for name in leftq.get_sorted_command_names()
> + if name in right]
> + if options.command_names:
> + command_names = [name for name in command_names
> + if name in options.command_names]
> +
> + for command_name in command_names:
> + left_N, left_min, left_max, left_avg = left[command_name]
> + right_N, right_min, right_max, right_avg = right[command_name]
> +
> + N_str = '%d/%d' % (left_N, right_N)
> + avg_str = '%7.2f|%+7.3f' % (do_div(left_avg, right_avg),
> + do_diff(left_avg, right_avg))
>
> - write_to_file(timings_path, timings)
> -
> -def cmd_show(*paths):
> - for timings_path in paths:
> - timings = read_from_file(timings_path)
> - print '---\n%s' % timings_path
> - print timings.summary()
> + if not verbose:
> + s.append('%9s %-16s %s' % (N_str, avg_str, command_name))
> + else:
> + min_str = '%7.2f|%+7.3f' % (do_div(left_min, right_min),
> + do_diff(left_min, right_min))
> + max_str = '%7.2f|%+7.3f' % (do_div(left_max, right_max),
> + do_diff(left_max, right_max))
> +
> + s.append('%9s %-16s %-16s %-16s %s' % (N_str, min_str, max_str, avg_str,
> + command_name))
> +
> + s.extend([
> + '(legend: "1.23|+0.45" means: slower by factor 1.23 and by 0.45 seconds;',
> + ' factor < 1 and difference < 0 means \'%s\' is faster.'
> + % left_kind.label,
> + ' "2/3" means: \'%s\' has 2 timings on record, the other has 3.)'
> + % left_kind.label
> + ])
> +
> +
> + print '\n'.join(s)
> +
> +
> +# ------------------------------------------------------- charts
> +
> +def cmdline_chart_compare(db, options, *args):
> + import numpy as np
> + import matplotlib.pyplot as plt
> +
> + labels = []
> + timing_sets = []
> + command_names = None
> +
> + for arg in args:
> + run_kind = RunKind(arg)
> + query = TimingQuery(db, run_kind)
> + timings = query.get_timings()
> + if not timings:
> + print "No timings for", run_kind.label
> + exit(1)
> + labels.append(run_kind.label)
> + timing_sets.append(timings)
>
> + if command_names:
> + for i in range(len(command_names)):
> + if not command_names[i] in timings:
> + del command_names[i]
> + else:
> + command_names = query.get_sorted_command_names()
>
> -def usage():
> - print __doc__
> + if options.command_names:
> + command_names = [name for name in command_names
> + if name in options.command_names]
> +
> + chart_path = options.chart_path
> + if not chart_path:
> + chart_path = 'compare_' + '_'.join(
> + [ filesystem_safe_string(l) for l in labels ]
> + ) + '.svg'
> +
> + print '\nwriting chart file:', chart_path
> +
> + N = len(command_names)
> + M = len(timing_sets) - 1
> +
> + ind = np.arange(N) # the x locations for the groups
> + width = 1. / (1.2 + M) # the width of the bars
> + dist = 0.15
> +
> + fig = plt.figure(figsize=(0.33*N*M,12))
> + plot1 = fig.add_subplot(211)
> + plot2 = fig.add_subplot(212)
> +
> + # invisible lines that make sure the scale doesn't get minuscule
> + plot1.axhline(y=101, color='white', linewidth=0.01)
> + plot1.axhline(y=95.0, color='white', linewidth=0.01)
> + plot2.axhline(y=0.1, color='white', linewidth=0.01)
> + plot2.axhline(y=-0.5, color='white', linewidth=0.01)
> +
> + reference = timing_sets[0]
> +
> + ofs = 0
> +
> + for label_i in range(1, len(labels)):
> + timings = timing_sets[label_i]
> + divs = []
> + diffs = []
> + divs_color = []
> + deviations = []
> + for command_name in command_names:
> + ref_N, ref_min, ref_max, ref_avg = reference[command_name]
> + this_N, this_min, this_max, this_avg = timings[command_name]
> +
> + val = 100. * (do_div(ref_avg, this_avg) - 1.0)
> + if val < 0:
> + col = '#55dd55'
> + else:
> + col = '#dd5555'
> + divs.append(val)
> + divs_color.append(col)
> + diffs.append( do_diff(ref_avg, this_avg) )
> + deviations.append(this_max / this_min)
> +
> + rects = plot1.bar(ind + ofs, divs, width * (1.0 - dist),
> + color=divs_color, bottom=100.0, edgecolor='none')
> +
> + for i in range(len(rects)):
> + x = rects[i].get_x() + width / 2.2
> + div = divs[i]
> + label = labels[label_i]
> +
> + plot1.text(x, 100.,
> + ' %+5.1f%% %s' % (div,label),
> + ha='center', va='top', size='small',
> + rotation=-90, family='monospace')
> +
> + rects = plot2.bar(ind + ofs, diffs, width * 0.9,
> + color=divs_color, bottom=0.0, edgecolor='none')
> +
> + for i in range(len(rects)):
> + x = rects[i].get_x() + width / 2.2
> + diff = diffs[i]
> + label = labels[label_i]
> +
> + plot2.text(x, 0.,
> + ' %+5.2fs %s' % (diff,label),
> + ha='center', va='top', size='small',
> + rotation=-90, family='monospace')
> +
> + ofs += width
> +
> + plot1.set_title('Speed change compared to %s [%%]' % labels[0])
> + plot1.set_xticks(ind + (width / 2.))
> + plot1.set_xticklabels(command_names, rotation=-55,
> + horizontalalignment='left',
> + size='x-small', weight='bold')
> + plot1.axhline(y=100.0, color='#555555', linewidth=0.2)
> + plot2.set_title('[seconds]')
> + plot2.set_xticks(ind + (width / 2.))
> + plot2.set_xticklabels(command_names, rotation=-55,
> + horizontalalignment='left',
> + size='medium', weight='bold')
> + plot2.axhline(y=0.0, color='#555555', linewidth=0.2)
> +
> + margin = 1.5/(N*M)
> + fig.subplots_adjust(bottom=0.1, top=0.97,
> + left=margin,
> + right=1.0-(margin / 2.))
> +
> + #plot1.legend( (rects1[0], rects2[0]), (left_label, right_label) )
> +
> + #plt.show()
> + plt.savefig(chart_path)
> +
> +# ------------------------------------------------------------ main
> +
> +
> +# Custom option formatter, keeping newlines in the description.
> +# adapted from:
> +# http://groups.google.com/group/comp.lang.python/msg/09f28e26af0699b1
> +import textwrap
> +class IndentedHelpFormatterWithNL(optparse.IndentedHelpFormatter):
> + def format_description(self, description):
> + if not description: return ""
> + desc_width = self.width - self.current_indent
> + indent = " "*self.current_indent
> + bits = description.split('\n')
> + formatted_bits = [
> + textwrap.fill(bit,
> + desc_width,
> + initial_indent=indent,
> + subsequent_indent=indent)
> + for bit in bits]
> + result = "\n".join(formatted_bits) + "\n"
> + return result
>
> if __name__ == '__main__':
> - parser = optparse.OptionParser()
> + parser = optparse.OptionParser(formatter=IndentedHelpFormatterWithNL())
> # -h is automatically added.
> ### should probably expand the help for that. and see about -?
> parser.add_option('-v', '--verbose', action='store_true', dest='verbose',
> help='Verbose operation')
> - parser.add_option('--svn', action='store', dest='svn', default='svn',
> - help='Specify Subversion executable to use')
> + parser.add_option('-b', '--svn-bin-dir', action='store', dest='svn_bin_dir',
> + default='',
> + help='Specify directory to find Subversion binaries in')
> + parser.add_option('-f', '--db-path', action='store', dest='db_path',
> + default='benchmark.db',
> + help='Specify path to SQLite database file')
> + parser.add_option('-o', '--chart-path', action='store', dest='chart_path',
> + help='Supply a path for chart output.')
> + parser.add_option('-c', '--command-names', action='store',
> + dest='command_names',
> + help='Comma separated list of command names to limit to.')
> +
> + parser.set_description(__doc__)
> + parser.set_usage('')
>
> - ### should start passing this, but for now: make it global
> - global options
>
> options, args = parser.parse_args()
>
> + def usage(msg=None):
> + parser.print_help()
> + if msg:
> + print
> + print msg
> + exit(1)
> +
> # there should be at least one arg left: the sub-command
> if not args:
> - usage()
> - exit(1)
> + usage('No command argument supplied.')
>
> cmd = args[0]
> del args[0]
>
> - if cmd == 'compare':
> - if len(args) != 2:
> - usage()
> - exit(1)
> - cmd_compare(*args)
> + db = TimingsDb(options.db_path)
>
> - elif cmd == 'combine':
> - if len(args) < 3:
> + if cmd == 'run':
> + if len(args) < 1 or len(args) > 2:
> usage()
> - exit(1)
> - cmd_combine(*args)
> + cmdline_run(db, options, *args)
>
> - elif cmd == 'run':
> - if len(args) < 3 or len(args) > 4:
> + elif cmd == 'compare':
> + if len(args) < 2:
> usage()
> - exit(1)
> - cmd_run(*args)
> + cmdline_compare(db, options, *args)
> +
> + elif cmd == 'list':
> + cmdline_list(db, options, *args)
>
> elif cmd == 'show':
> - if not args:
> + cmdline_show(db, options, *args)
> +
> + elif cmd == 'chart':
> + if 'compare'.startswith(args[0]):
> + cmdline_chart_compare(db, options, *args[1:])
> + else:
> usage()
> - exit(1)
> - cmd_show(*args)
>
> else:
> - usage()
> + usage('Unknown command argument: %s' % cmd)
>
> Modified: subversion/trunk/tools/dev/benchmarks/suite1/run
> URL: http://svn.apache.org/viewvc/subversion/trunk/tools/dev/benchmarks/suite1/run?rev=1357564&r1=1357563&r2=1357564&view=diff
> ==============================================================================
> --- subversion/trunk/tools/dev/benchmarks/suite1/run (original)
> +++ subversion/trunk/tools/dev/benchmarks/suite1/run Thu Jul 5 11:46:17 2012
> @@ -17,31 +17,51 @@
> # specific language governing permissions and limitations
> # under the License.
>
> -# Where are the svn binaries you want to benchmark?
> -SVN_A_NAME="1.7.x"
> -SVN_A="$HOME/pat/bench/prefix/bin/svn"
> +# debug? Just uncomment.
> +DEBUG=DEBUG_
> +
> +# Subversion bin-dir used for maintenance of working copies
> +SVN_STABLE="$HOME/pat/stable/prefix/bin/"
> +
> +# Where to find the svn binaries you want to benchmark, what are their labels
> +# and Last Changed Revisions?
> +# side A
> +SVN_A_NAME="1.7.0"
> +SVN_A="$HOME/pat/bench/prefix/bin"
> +SVN_A_REV="$("$SVN_STABLE"/svnversion -c "$HOME/pat/bench/src" | sed 's/.*://')"
> +SVN_A_LABEL="$SVN_A_NAME@$SVN_A_REV"
> +
> +# side B
> SVN_B_NAME="trunk"
> -SVN_B="$HOME/pat/trunk/prefix/bin/svn"
> +SVN_B="$HOME/pat/trunk/prefix/bin"
> +SVN_B_REV="$("$SVN_STABLE"/svnversion -c "$HOME/pat/trunk/src" | sed 's/.*://')"
> +SVN_B_LABEL="$SVN_B_NAME@$SVN_B_REV"
>
> -benchmark="$PWD/benchmark.py"
> +echo "$SVN_A_LABEL vs. $SVN_B_LABEL"
>
> -parent="$(date +"%Y%m%d-%H%M%S")"
> -inital_workdir="$PWD"
> -mkdir "$parent"
> -cd "$parent"
> -pwd
> +# benchmark script and parameters...
> +benchmark="$PWD/benchmark.py"
>
> +db="$PWD/${DEBUG}benchmark.db"
>
> batch(){
> levels="$1"
> spread="$2"
> N="$3"
> - pre="${levels}x${spread}_"
> - "$benchmark" "--svn=$SVN_A" run "${pre}$SVN_A_NAME" $levels $spread $N >/dev/null
> - "$benchmark" "--svn=$SVN_B" run "${pre}$SVN_B_NAME" $levels $spread $N >/dev/null
> +
> + # SVN_A is a fixed tag, say 1.7.0. For each call, run this once.
> + # It will be called again and again for each trunk build being tested,
> + # that's why we don't really need to run it $N times every time.
> + "$benchmark" "--db-path=$db" "--svn-bin-dir=$SVN_A" \
> + run "$SVN_A_NAME@$SVN_A_REV,${levels}x$spread" 1 >/dev/null
> +
> + # SVN_B is a branch, i.e. the moving target, benchmarked at a specific
> + # point in history each time this script is called. Run this $N times.
> + "$benchmark" "--db-path=$db" "--svn-bin-dir=$SVN_B" \
> + run "$SVN_B_NAME@$SVN_B_REV,${levels}x$spread" $N >/dev/null
> }
>
> -N=6
> +N=3
> al=5
> as=5
> bl=100
> @@ -49,15 +69,16 @@ bs=1
> cl=1
> cs=100
>
> -##DEBUG
> -#N=1
> -#al=1
> -#as=1
> -#bl=2
> -#bs=1
> -#cl=1
> -#cs=2
> -##DEBUG
> +if [ -n "$DEBUG" ]; then
> + echo "DEBUG"
> + N=1
> + al=1
> + as=1
> + bl=2
> + bs=1
> + cl=1
> + cs=2
> +fi
>
>
> {
> @@ -65,7 +86,7 @@ started="$(date)"
> echo "Started at $started"
>
> echo "
> -*Disclaimer:* this tests only file://-URL access on a GNU/Linux VM.
> +*Disclaimer* - This tests only file://-URL access on a GNU/Linux VM.
> This is intended to measure changes in performance of the local working
> copy layer, *only*. These results are *not* generally true for everyone."
>
> @@ -73,14 +94,12 @@ batch $al $as $N
> batch $bl $bs $N
> batch $cl $cs $N
>
> -"$benchmark" combine "total_$SVN_A_NAME" *x*"_$SVN_A_NAME" >/dev/null
> -"$benchmark" combine "total_$SVN_B_NAME" *x*"_$SVN_B_NAME" >/dev/null
> -
> echo ""
> echo "Averaged-total results across all runs:"
> echo "---------------------------------------"
> echo ""
> -"$benchmark" compare "total_$SVN_A_NAME" "total_$SVN_B_NAME"
> +"$benchmark" "--db-path=$db" \
> + compare "$SVN_A_NAME" "$SVN_B_NAME@$SVN_B_REV"
>
> echo ""
> echo ""
> @@ -88,8 +107,9 @@ echo "Above totals split into separate <
> echo "----------------------------------------------------------------"
> echo ""
>
> -for pre in "${al}x${as}_" "${bl}x${bs}_" "${cl}x${cs}_"; do
> - "$benchmark" compare "${pre}$SVN_A_NAME" "${pre}$SVN_B_NAME"
> +for lvlspr in "${al}x${as}" "${bl}x${bs}" "${cl}x${cs}"; do
> + "$benchmark" "--db-path=$db" \
> + compare "$SVN_A_NAME,$lvlspr" "$SVN_B_NAME@$SVN_B_REV,$lvlspr"
> echo ""
> done
>
> @@ -99,8 +119,13 @@ echo "More detail:"
> echo "------------"
> echo ""
>
> -for pre in "${al}x${as}_" "${bl}x${bs}_" "${cl}x${cs}_" "total_"; do
> - "$benchmark" compare -v "${pre}$SVN_A_NAME" "${pre}$SVN_B_NAME"
> +for lvlspr in "${al}x${as}" "${bl}x${bs}" "${cl}x${cs}" "" ; do
> + "$benchmark" "--db-path=$db" show "$SVN_A_NAME,$lvlspr"
> + echo --
> + "$benchmark" "--db-path=$db" show "$SVN_B_NAME,$lvlspr"
> + echo --
> + "$benchmark" "--db-path=$db" \
> + compare -v "$SVN_A_NAME,$lvlspr" "$SVN_B_NAME@$SVN_B_REV,$lvlspr"
> echo ""
> echo ""
> done
> @@ -111,7 +136,3 @@ echo " done at $(date)"
> pwd
> } 2>&1 | tee results.txt
>
> -cd "$inital_workdir"
> -if [ -f "$parent/total_trunk" ]; then
> - rm -rf "$parent"
> -fi
>
>

application/pgp-signature attachment: OpenPGP digital signature

Received on 2012-07-05 16:04:50 CEST

This message: [ Message body ]
Next message: neels_at_apache.org: "[svnbench] Revision: 1357620 compiled Jul 5 2012, 14:10:32"
Previous message: C. Michael Pilato: "Re: Incorrect LastChangedRev in working copy after committing directory move + modified file in subdir"
Next in thread: Daniel Shahaf: "Re: particular revisions for benchmarks?"
Reply: Daniel Shahaf: "Re: particular revisions for benchmarks?"

Contemporary messages sorted: [ by date ] [ by thread ] [ by subject ] [ by author ] [ by messages with attachments ]