View on GitHub

dumbpm

A pretty dumb PM

dumbpm

Installation
Prioritize projects
- Projects format
- Example
Estimate project duration without historical data
- Input format
- Example
Estimate project duration based on historical data
- Input format
- Example

A pretty dumb PM.

The whole philosophy behind dumbpm is that PMs (project managers, program managers, product managers, people managers, pokemon masters, etc.) all do some tasks that could use some automation, but at the same time this automation should be as dumb as possible. There are so many changing factors that influence such tasks that you better hire a PM (or a team of researchers to do the automation) to do the clever stuff, not a random software on the internet. In addition, if we keep it dumb, people can just read the code and understand what is going on, if they really want to.

If you have any suggestions for something (but nothing clever!) that you would like dumbpm to do for you, open an issue and let me know.

Installation

pip install dumbpm

Tested on both Linux and Mac OS. Windows might work.

NOTE Python 3.9+ required.

Prioritize projects

Giving a table of projects defined as below, it outputs a list of projects in order of priority within the optionally specified budget (prioritization as “data problem”).

The prioritized list is modelled as the exact solution of a Knapsack Problem with the following value function: norm(value) / (norm(cost) + norm(duration) + norm(risk)). Pretty dumb, indeed.

If --budget isn’t specified, the list will simply be sorted by the result of the above value function. Budget is relative to the cost parameter.

If you are expressing cost as cost per unit of duration (e.g. developers per week or sprint), you’ll have to specify the --cost-per-duration option, so that total cost = cost * duration and budget is measured against total cost (duration will also disappear from the value function not to count it twice).

By setting the pick parameter you can also “cheat” and tell the solver to always include a project in the solution. This can be useful if you want to see how the solution would change based on that choice.

$ dumbpm prioritize --help
usage: dumbpm prioritize [-h] [--budget [BUDGET]] filename

positional arguments:
  filename           CSV file with projects definition

optional arguments:
  -h, --help         show this help message and exit
  --budget [BUDGET]  Max budget allowed
  --cost-per-duration  Cost is to be assumed per unit of duration. Budget =
                       (cost * duration)

Projects format

Project definition happens in a CSV file with the following structure:

Project: [required] name of the project
Value: [required] value of the project
Cost: [required] cost of the project
Duration: [optional] duration of the project expressed in unit of times
Risk: [optional] risk of failure of the project
Pick: [optional, default = FALSE] force the inclusion of the project
Alternatives: [optional, empty field = empty list] comma separated list of projects that are incompatible with this one (e.g. make lunch vs buy lunch)

There is a bit of slack on the headers of the columns (e.g. Project, Projects, project, etc. are all alright). Notable mentions: alts and alternatives work the same; PQ can be used instead of cost if that’s your thing.

Project	Value	Cost	Duration	Risk	Pick	Alternatives
Buy a better espresso machine	5	4	2	1	TRUE
Buy ambient parfume for the back	1	1	1	1
Find and remove source of bad smell	5	2	4	4	TRUE
Find better coffee vendors	4	2	6	3
Buy smart component for roaster	3	5	2	3		Build in-house roasting notification system
Introduce a periodical suggestion survey for treats	3	2	6	1
Buy a more modern sign	3	3	1	1
Contact a designer to re-think the front	5	5	6	3
Buy beds for powernaps	1	3	1	1
Import treats from France	2	4	2	2
Build in-house roasting notification system	3	5	6	5		Buy smart component for roaster

Example

$ cat projects.csv
Project,Value,Cost,Duration,Pick,Alternatives
Buy a better espresso machine,5,4,2,TRUE,
Buy ambient parfume for the back,1,1,1,,
Find and remove source of bad smell,5,2,4,TRUE,
Find better coffee vendors,4,2,6,,
Buy smart component for roaster,3,5,2,,Build in-house roasting notification system
Introduce a periodical suggestion survey for treats,3,2,6,,
Buy a more modern sign,3,3,1,,
Contact a designer to re-think the front,5,5,6,,
Buy beds for powernaps,1,3,1,,
Import treats from France,2,4,2,,
Build in-house roasting notification system,3,5,6,,Buy smart component for roaster

$ dumbpm prioritize projects.csv
01 Buy ambient parfume for the back
02 Buy a more modern sign
03 Find and remove source of bad smell
04 Buy a better espresso machine
05 Find better coffee vendors
06 Buy beds for powernaps
07 Buy smart component for roaster
08 Import treats from France
09 Introduce a periodical suggestion survey for treats
10 Contact a designer to re-think the front

$ dumbpm prioritize projects.csv --budget 10
01 Buy ambient parfume for the back
02 Buy a more modern sign
03 Find and remove source of bad smell
04 Buy a better espresso machine

Estimate project duration without historical data

Giving a list of tasks or milestones the project can be broken into and (best case, expected, worst case) estimates for the duration of such tasks, it outputs an estimate of the project duration in the form of a probability distribution (median, variance, percentiles). You can use these numbers to formulate guesstimates like “I am 75% confident that we will complete the project in 38 weeks”. Instead of using a unit of time (e.g. days, weeks), you can instead specify story points for each tasks and so estimate the project size in story point instead of its duration.

The estimation is based on a Monte Carlo simulation of the following equation: project_duration = sum(duration, over=tasks). Tasks duration for each iteration are derived from a modified-PERT distribution interpolating the (best case, expected, worst case) estimates for the task. Pretty dumb, indeed.

The command is called guesstimate to highlight that the result of this method should be considered with lower confidence than the one produced by the estimate command, which is based on historical data. You can switch to estimate after you conclude a few sprints, while relying on guesstimate for the initial estimates during the planning period.

If it is taking too long to perform the estimation on your computer, set --simulations to something lower than 10000.

$ dumbpm guesstimate --help
usage: dumbpm guesstimate [-h] [--simulations [SIMULATIONS]] filename

positional arguments:
  filename              CSV file with tasks estimates

optional arguments:
  -h, --help            show this help message and exit
  --simulations [SIMULATIONS]
                        Number of simulations to run

Input format

Estimates for tasks or milestones have to be defined in a CSV file with the following structure:

Task: [required] name of the task (Milestone works as header, too)
Best: [required] best case estimate for the task duration
Expected: [required] most likely estimate for the task duration
Worst: [required] worst case estimate for the task duration

Task	Best	Expected	Worst
Task A	5	10	20
Task B	6	12	40
Task C	1	13	24
Task D	10	13	15
Task E	5	7	12
Task F	12	25	34

Example

$ cat tasks.csv
Task,Best,Expected,Worst
Task A,5,10,20
Task B,6,12,40
Task C,1,13,24
Task D,10,13,15
Task E,5,7,12
Task F,12,25,34

$ dumbpm guesstimate tasks.csv
            Duration
count   10000.000000
mean        7.761430
std         0.993793
min         5.000000
50%         8.000000
5%         8.000000
90%         9.000000
99%        10.000000
max        12.000000

Estimate project duration based on historical data

Giving a list of past sprint velocities and (optionally) a list of scope changes for the project in story points defined as below, it outputs an estimate of the project duration in the form of a probability distribution (median, variance, percentiles). You can use these numbers to formulate guesstimates like “I am 75% confident that we will complete the project in 38 weeks”.

Please note that the estimate is measured in sprints, so you’ll have to multiply that for the duration of your sprint and project that on your working calendar to account for holidays and anything else which could affect your schedule.

The estimation is based on a Monte Carlo simulation of the following inequation: scope + sum(scope_change, over=sprints) <= sum(velocity, over=sprints). Pretty dumb, indeed.

By default, velocity and scope change for each iteration are picked at random following a uniform probability distribution from the provided historical data. If --normal is specified, the input will be modelled as normal distribution from which velocity and scope changes will be derived.

If it is taking too long to perform the estimation on your computer, set --simulations to something lower than 10000.

$ dumbpm estimate --help
usage: dumbpm estimate [-h] [--normal] [--simulations [SIMULATIONS]] filename scope

positional arguments:
  filename              CSV file with velocity and scope change datapoints
  scope                 Remaining scope in story points for the project

optional arguments:
  -h, --help            show this help message and exit
  --normal              Use a normal distribution for the input data
  --simulations [SIMULATIONS]
                        Number of simulations to run

Input format

Historical data has to be defined in a CSV file with the following structure:

Velocity: [required] velocity for each past sprint
Change: [optional] project scope change for each past sprint (zero and negative values are allowed)

Velocity	Change
17	5
19	1
10	0
12	0
21	1
7	-3
15	-2
12	5
12	0
14	2
18	-4

Example

$ cat sprints.csv
Velocity,Change
17,5
19,1
10,0
12,0
21,1
7,-3
15,-2
12,5
12,0
14,2
18,-4

$ dumbpm estimate sprints.csv 100
            Duration
count   10000.000000
mean        7.761430
std         0.993793
min         5.000000
50%         8.000000
75%         8.000000
90%         9.000000
99%        10.000000
max        12.000000

$ dumbpm prioritize projects.csv 100 --normal
           Duration
count   10000.00000
mean        7.75637
std         0.99810
min         4.00000
50%         8.00000
75%         8.00000
90%         9.00000
99%        10.00000
max        13.00000