ChIP-Seq auxiliary Perl applications
============================================================================
In this directory we have collected a series of Perl scripts that can be used to perform format conversion tasks as well as other auxiliary tasks such as read counts filtering or SAG file compression.
The ChIP-seq main programs use as a format a simplified BED format, called SGA (Simplified Genome Annotation), which is sorted by sequence name and position.

In a typical data analysis pipeline, the SGA file is often generated from a variety of richer formats, such as the Solexa genome mapping format, BAM BED, or FPS (Functional Position Set). The latter is used by the Signal Search Analysis programs at SIB (SSA).
We therefore provide simple and fast tools to convert SGA data files to other formats, especially BED, WIG (Wiggle Track Format) and FPS, and vice-versa.
WIG and BED files are used for viewing ChIP-seq data and results at the UCSC genome browser.

The binary file chro_idx.nstorage includes a Perl hash table that, for each supported assembly, stores chromosome number-NCBI identifier pairs as well as chromosome lengths indexed by chromosome NCBI identifiers.
This file is used by most conversion scripts. When required, its location (<path>) must be set by using the --db <path> option.
