Dataset

The set of files to use during this course is available at: seq.space/dataset.zip.1)

To download and expand it in your home, the list of commands is:

cd
wget "http://seq.space/dataset.zip"
unzip dataset.zip
rm dataset.zip

The archive contains four directories: ecoli, with a set of files from E. coli K-122), GenBank, with two GenBank records, misc, with a couple of files, test with a full set of files.

├── ecoli
│   ├── ecoli.genes.fa
│   ├── ecoli.genome.fa
│   ├── ecoli.gff3
│   └── ecoli.proteins.fa
├── GenBank
│   ├── E.coli.genbank
│   └── Y.pestis.genbank
├── misc
│   ├── excel_data.csv
│   └── oligos.txt
└── test
    ├── cat.jpg
    ├── data.txt
    ├── dna.fa
    ├── dog.jpg
    ├── goldfish.jpg
    ├── motifs.fa
    ├── Music
    ├── Pictures
    ├── proteins.fa
    ├── README.txt
    ├── Sequences
    ├── song1.mp3
    ├── song2.mp3
    ├── Text
    └── todo.txt
1)
The data included is partly taken from Unix and Perl Course