batch_ltrseq.pl


NAME

batch_ltrseq.pl - Run the LTR_seq program in batch mode.


VERSION

This documentation refers to program version $Rev: 559 $


SYNOPSIS

Usage

    batch_ltrseq.pl -i in_dir -o out_dir

Required Options

    -i        # Intput directory of files to process
              # These must be in fasta file format
    -o        # Base output directory


DESCRIPTION

This program runs the LTR_seq program for each file in the input directory. An optional configuartion file can also be used to run the LTR_seq program for multiple LTR_seq parameter combinations. If no configuration file is provided, batch_ltrseq.pl will run LTR_seq using default parameters.


REQUIRED ARGUMENTS

-i,--indir

Path of the directory containing the sequences to process.

-o,--outdir

Path of the directory to place the program output.


OPTIONS

-c,--config

Path to a configuration file. This config file will allow the batch_ltrseq.pl program to run the LTR_seq program for multiple parameter combinations for every fasta file in the input sequence directory. If no configuartion file is used, the LTR_seq program will be run with default parameters.

The configuration file will be a two column, tab delimited text file with the following options:

An example configuration file is shown below:

    def     DEF
    long    long_ltr.cfg
    old     lod_ltr.cfg

This configuration file would run three sets of parameter files for every sequence in the input directory.

--test

Run the batch_ltrseq.pl program in test mode. This will create the required directory structure, test for existence of files and other program functions. However, this will not actually run the LTR_seq program.

--usage

Short overview of how to use program from command line.

--help

Show program usage with summary of options.

--version

Show program version.

--man

Show the full program manual. This uses the perldoc command to print the POD documentation for the program.

-q,--quiet

Run the program with minimal output.


DIAGNOSTICS

ERROR: Could not create root name dir

The program could not create the directory needed to hold the output for the sequence file. It is possible that you not have the proper access to create a directory in the path that you specified. It is also possible that the directory that you are trying to create this subdirectory in does not exist.


CONFIGURATION AND ENVIRONMENT

Configuration File

This program can optionally make use of a configuration file. The path to configuration file is declared with the --config option at the command line. This config file will allow the batch_ltrseq.pl program to run the LTR_seq program for multiple parameter combinations for every fasta file in the input sequence directory. If no configuartion file is used, the LTR_seq program will be run with default parameters.

The configuration file will be a two column, tab delimited text file with the following options:

An example configuration file is shown below:

    def     DEF
    long    long_ltr.cfg
    old     old_ltr.cfg

This configuration file would run three sets of LTR_seq configurations for every sequence in the input directory. These configurations would include (1) the default parameter set, (2) the parameter set name long that is specified by the LTR_seq config file long_ltr.cfg, and (3) the parameter set name old that is specified by the parameter set old_ltr.cfg.

Please see the LTR_seq documentation for specific information on the LTR_seq configuration files. The configuration files in LTR_seq allow you to specifiy the following parameters:

An example LTR_seq config file is shown below:

    window  12
    Dmin 600
    Dmax 15000
    LTRmax 2000
    match 2
    mismatch -5
    gap -1
    hgap -6
    AlignmentWithN -5
    MaxScoreRatioThreshold  15
    LTRmin 100
    TSR_len 6
    LTRminExactMatch 30

Environment

This program does not make use of variables set in the user environment.


DEPENDENCIES

Required Software

The following software is required for the batch_ltrseq.pl program to run properly.

Required Perl Modules


BUGS AND LIMITATIONS

Bugs

Limitations


REFERENCE

A manuscript is being submitted describing the DAWGPAWS program. Until this manuscript is published, please refer to the DAWGPAWS SourceForge website when describing your use of this program:

JC Estill and JL Bennetzen. 2009. The DAWGPAWS Pipeline for the Annotation of Genes and Transposable Elements in Plant Genomes. http://dawgpaws.sourceforge.net/


LICENSE

GNU GENERAL PUBLIC LICENSE, VERSION 3

http://www.gnu.org/licenses/gpl.html

THIS SOFTWARE COMES AS IS, WITHOUT ANY EXPRESS OR IMPLIED WARRANTY. USE AT YOUR OWN RISK.


AUTHOR

James C. Estill <JamesEstill at gmail.com>


HISTORY

STARTED: 09/06/2007

UPDATED: 03/24/2009

VERSION: $Rev: 559 $

 batch_ltrseq.pl