Search — ruffus 2.6.2 documentation

ruffus

Installation
Ruffus Manual: List of Chapters and Example code
Chapter 1: An introduction to basic Ruffus syntax
Chapter 2: Transforming data in a pipeline with @transform
Chapter 3: More on @transform-ing data
Chapter 4: Creating files with @originate
- Simplifying our example with @originate
Chapter 5: Understanding how your pipeline works with pipeline_printout(...)
Chapter 6: Running Ruffus from the command line with ruffus.cmdline
Chapter 7: Displaying the pipeline visually with pipeline_printout_graph(...)
Chapter 8: Specifying output file names with formatter() and regex()
Chapter 9: Preparing directories for output with @mkdir()
- Overview
- Creating directories after string substitution in a zoo...
Chapter 10: Checkpointing: Interrupted Pipelines and Exceptions
Chapter 11: Pipeline topologies and a compendium of Ruffus decorators
Chapter 12: Splitting up large tasks / files with @split
Chapter 13: @merge multiple input into a single result
Chapter 14: Multiprocessing, drmaa and Computation Clusters
Chapter 15: Logging progress through a pipeline
Chapter 16: @subdivide tasks to run efficiently and regroup with @collate
Chapter 17: @combinations, @permutations and all versus all @product
Chapter 18: Turning parts of the pipeline on and off at runtime with @active_if
- Overview
- @active_if controls the state of tasks
Chapter 19: Signal the completion of each stage of our pipeline with @posttask
- Overview
Chapter 20: Manipulating task inputs via string substitution using inputs() and add_inputs()
Chapter 21: Esoteric: Generating parameters on the fly with @files
Chapter 22: Esoteric: Running jobs in parallel without files using @parallel
- @parallel
Chapter 23: Esoteric: Writing custom functions to decide which jobs are up to date with @check_if_uptodate
- @check_if_uptodate : Manual dependency checking
Appendix 1: Flow Chart Colours with pipeline_printout_graph(...)
- Flowchart colours
Appendix 2: How dependency is checked
- Overview
Appendix 3: Exceptions thrown inside pipelines
Appendix 4: Names exported from Ruffus
- Ruffus Names
Appendix 5: @files: Deprecated syntax
Appendix 6: @files_re: Deprecated syntax using regular expressions
- Overview

Chapter 1: Python Code for An introduction to basic Ruffus syntax
- Your first Ruffus script
- Resulting Output
Chapter 1: Python Code for Transforming data in a pipeline with @transform
- Your first Ruffus script
- Resulting Output
Chapter 3: Python Code for More on @transform-ing data
Chapter 4: Python Code for Creating files with @originate
- Using @originate
- Resulting Output
Chapter 5: Python Code for Understanding how your pipeline works with pipeline_printout(...)
Chapter 7: Python Code for Displaying the pipeline visually with pipeline_printout_graph(...)
- Code
- Resulting Flowcharts
Chapter 8: Python Code for Specifying output file names with formatter() and regex()
Chapter 9: Python Code for Preparing directories for output with @mkdir()
- Code for formatter() Zoo example
- Code for regex() Zoo example
Chapter 10: Python Code for Checkpointing: Interrupted Pipelines and Exceptions
- Code for the “Interrupting tasks” example
Chapter 12: Python Code for Splitting up large tasks / files with @split
- Splitting large jobs
- Resulting Output
Chapter 13: Python Code for @merge multiple input into a single result
- Splitting large jobs
- Resulting Output
Chapter 14: Python Code for Multiprocessing, drmaa and Computation Clusters
- @jobs_limit
- Using ruffus.drmaa_wrapper
Chapter 15: Python Code for Logging progress through a pipeline
- Rotating set of file logs
Chapter 16: Python Code for @subdivide tasks to run efficiently and regroup with @collate
- @subdivide and regroup with @collate example
Chapter 17: Python Code for @combinations, @permutations and all versus all @product
Chapter 20: Python Code for Manipulating task inputs via string substitution using inputs() and add_inputs()
- Example code for adding additional input prerequisites per job with add_inputs()
- Example code for replacing all input parameters with inputs()
Chapter 21: Esoteric: Python Code for Generating parameters on the fly with @files
Appendix 1: Python code for Flow Chart Colours with pipeline_printout_graph(...)
- Code

Cheat Sheet
Pipeline functions
drmaa functions
- run_job
Installation
Design & Architecture
Major Features added to Ruffus
Fixed Bugs
New Object orientated syntax for Ruffus in Version 2.6
Worked Example for New Object orientated syntax for Ruffus in Version 2.6
- Worked example
Python Code for: New Object orientated syntax for Ruffus in Version 2.6
Where I see Ruffus going
In up coming release:
Future Changes to Ruffus
Planned Improvements to Ruffus
Implementation Tips
Implementation notes
FAQ
Glossary
Hall of Fame: User contributed flowcharts
Why Ruffus?

Construction of a simple pipeline to run BLAST jobs
Part 2: A slightly more practical pipeline to run blasts jobs
Ruffus code
Ruffus code
Example code for FAQ Good practices: "What is the best way of handling data in file pairs (or triplets etc.)?"

Ruffus Decorators
Indicator Objects
- formatter
- suffix
- regex
- add_inputs
- inputs
- mkdir
- touch_file
- output_from
- combine

@originate ( output, [extras,...] )
@split ( input, output, [extras,...] )
@transform( input, filter, output, [extras,...] )
@merge ( input, output, [extras,...] )

@subdivide
- @subdivide ( input, regex(matching_regex) | formatter(matching_formatter), [ inputs (input_pattern_or_glob) | add_inputs (input_pattern_or_glob) ], output, [extras,...] )
@transform( input, filter, replace_inputs | add_inputs, output, [extras,...] )
@collate( input, filter, output, [extras,...] )
@collate( input, filter, replace_inputs | add_inputs, output, [extras,...] )
@graphviz
- @graphviz ( graphviz_parameters,...] )
@mkdir( input, filter, output )
@jobs_limit
- @jobs_limit ( maximum_num_of_jobs, [ name ])
@posttask
- @posttask (function | touch_file(file_name))
@active_if
- @active_if(on_or_off1, [on_or_off2,...])
@follows
- @follows(task | “task_name” | mkdir (directory_name), [more_tasks, ...])

@product( input, filter, [input2, filter2, ...], output, [extras,...] )
@permutations( input, filter, tuple_size, output, [extras,...] )
@combinations( input, filter, tuple_size, output, [extras,...] )
@combinations_with_replacement( input, filter, tuple_size, output, [extras,...] )

Generating parameters on the fly for @files
- @files (custom_function)
@check_if_uptodate
- @check_if_uptodate (dependency_checking_function)
@parallel
- @parallel ( [ [job_params, ...], [job_params, ...]...] | parameter_generating_function)

@files
- @files (input1, output1, [extra_parameters1, ...])
- @files ( (( input, output, [extra_parameters,...] ), (...), ...) )
@files_re
- @files_re (tasks_or_file_names, matching_regex, [input_pattern], output_pattern, [extra_parameters,...])

ruffus.Task
ruffus.proxy_logger

ruffus

Docs »
Edit on GitHub

Built with Sphinx using a theme provided by Read the Docs.