diff --git a/.travis.yml b/.travis.yml
index d4ea00e6e4e..a3b305de373 100644
--- a/.travis.yml
+++ b/.travis.yml
@@ -6,6 +6,8 @@ python: 2.7
env:
- CHUNK=0
- CHUNK=1
+ - CHUNK=2
+ - CHUNK=3
before_install:
- export GALAXY_REPO=https://github.com/galaxyproject/galaxy
@@ -32,7 +34,7 @@ install:
planemo ci_find_repos --exclude_from .tt_blacklist \
--exclude packages --exclude data_managers \
--changed_in_commit_range "$TRAVIS_COMMIT_RANGE" \
- --chunk_count 2 --chunk "${CHUNK}" \
+ --chunk_count 4 --chunk "${CHUNK}" \
--output changed_repositories_chunk.list
- cat changed_repositories_chunk.list
diff --git a/tool_collections/kraken/README.rst b/tool_collections/kraken/README.rst
new file mode 100644
index 00000000000..4acd6859923
--- /dev/null
+++ b/tool_collections/kraken/README.rst
@@ -0,0 +1,92 @@
+Introduction
+============
+
+`Kraken `__ is a taxonomic sequence
+classifier that assigns taxonomic labels to short DNA reads. It does
+this by examining the :math:`k`-mers within a read and querying a
+database with those :math:`k`-mers. This database contains a mapping of
+every :math:`k`-mer in
+`Kraken `__'s genomic library to
+the lowest common ancestor (LCA) in a taxonomic tree of all genomes that
+contain that :math:`k`-mer. The set of LCA taxa that correspond to the
+:math:`k`-mers in a read are then analyzed to create a single taxonomic
+label for the read; this label can be any of the nodes in the taxonomic
+tree. `Kraken `__ is designed to be
+rapid, sensitive, and highly precise. Our tests on various real and
+simulated data have shown
+`Kraken `__ to have sensitivity
+slightly lower than Megablast with precision being slightly higher. On a
+set of simulated 100 bp reads,
+`Kraken `__ processed over 1.3
+million reads per minute on a single core in normal operation, and over
+4.1 million reads per minute in quick operation.
+
+The latest released version of Kraken will be available at the `Kraken
+website `__, and the latest updates
+to the Kraken source code are available at the `Kraken GitHub
+repository `__.
+
+If you use `Kraken `__ in your
+research, please cite the `Kraken
+paper `__. Thank you!
+
+System Requirements
+===================
+
+Note: Users concerned about the disk or memory requirements should read
+the paragraph about MiniKraken, below.
+
+- **Disk space**: Construction of Kraken's standard database will
+ require at least 160 GB of disk space. Customized databases may
+ require more or less space. Disk space used is linearly proportional
+ to the number of distinct :math:`k`-mers; as of Feb. 2015, Kraken's
+ default database contains just under 6 billion (6e9) distinct
+ :math:`k`-mers.
+
+ In addition, the disk used to store the database should be
+ locally-attached storage. Storing the database on a network
+ filesystem (NFS) partition can cause Kraken's operation to be very
+ slow, or to be stopped completely. As NFS accesses are much slower
+ than local disk accesses, both preloading and database building will
+ be slowed by use of NFS.
+
+- **Memory**: To run efficiently, Kraken requires enough free memory to
+ hold the database in RAM. While this can be accomplished using a
+ ramdisk, Kraken supplies a utility for loading the database into RAM
+ via the OS cache. The default database size is 75 GB (as of Feb.
+ 2015), and so you will need at least that much RAM if you want to
+ build or run with the default database.
+
+- **Dependencies**: Kraken currently makes extensive use of Linux
+ utilities such as sed, find, and wget. Many scripts are written using
+ the Bash shell, and the main scripts are written using Perl. Core
+ programs needed to build the database and run the classifier are
+ written in C++, and need to be compiled using g++. Multithreading is
+ handled using OpenMP. Downloads of NCBI data are performed by wget
+ and in some cases, by rsync. Most Linux systems that have any sort of
+ development package installed will have all of the above listed
+ programs and libraries available.
+
+ Finally, if you want to build your own database, you will need to
+ install the
+ `Jellyfish `__
+ :math:`k`-mer counter. Note that Kraken only supports use of
+ Jellyfish version 1. Jellyfish version 2 is not yet compatible with
+ Kraken.
+
+- **Network connectivity**: Kraken's standard database build and
+ download commands expect unfettered FTP and rsync access to the NCBI
+ FTP server. If you're working behind a proxy, you may need to set
+ certain environment variables (such as ``ftp_proxy`` or
+ ``RSYNC_PROXY``) in order to get these commands to work properly.
+
+- **MiniKraken**: To allow users with low-memory computing environments
+ to use Kraken, we supply a reduced standard database that can be
+ downloaded from the Kraken web site. When Kraken is run with a
+ reduced database, we call it MiniKraken.
+
+ The database we make available is only 4 GB in size, and should run
+ well on computers with as little as 8 GB of RAM. Disk space required
+ for this database is also only 4 GB.
+
+
diff --git a/tool_collections/kraken/database.idx b/tool_collections/kraken/database.idx
new file mode 100644
index 00000000000..1a06ff64f27
Binary files /dev/null and b/tool_collections/kraken/database.idx differ
diff --git a/tool_collections/kraken/database.kdb b/tool_collections/kraken/database.kdb
new file mode 100644
index 00000000000..35447d3771e
Binary files /dev/null and b/tool_collections/kraken/database.kdb differ
diff --git a/tool_collections/kraken/kraken/.shed.yml b/tool_collections/kraken/kraken/.shed.yml
new file mode 100644
index 00000000000..00699d0b194
--- /dev/null
+++ b/tool_collections/kraken/kraken/.shed.yml
@@ -0,0 +1,16 @@
+categories:
+- Metagenomics
+description: Kraken for taxonomic designation.
+homepage_url: http://ccb.jhu.edu/software/kraken/
+long_description: |
+ Kraken is a system for assigning taxonomic labels to short DNA
+ sequences, usually obtained through metagenomic studies. Previous attempts by other
+ bioinformatics software to accomplish this task have often used sequence alignment
+ or machine learning techniques that were quite slow, leading to the development
+ of less sensitive but much faster abundance estimation programs. Kraken aims to
+ achieve high sensitivity and high speed by utilizing exact alignments of k-mers
+ and a novel classification algorithm.
+name: kraken
+owner: devteam
+remote_repository_url: https://github.com/galaxyproject/tools-iuc/blob/master/tool_collections/kraken/kraken/
+type: unrestricted
diff --git a/tool_collections/kraken/kraken/README.rst b/tool_collections/kraken/kraken/README.rst
new file mode 120000
index 00000000000..89a0106941f
--- /dev/null
+++ b/tool_collections/kraken/kraken/README.rst
@@ -0,0 +1 @@
+../README.rst
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken/kraken.xml b/tool_collections/kraken/kraken/kraken.xml
new file mode 100644
index 00000000000..10e5f3039e7
--- /dev/null
+++ b/tool_collections/kraken/kraken/kraken.xml
@@ -0,0 +1,149 @@
+
+
+
+ assign taxonomic labels to sequencing reads
+
+
+ macros.xml
+
+
+
+
+ "${output}"
+ ##kraken-translate --db ${kraken_database.fields.name} "${output}" > "${translated}"
+ ]]>
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+ (split_reads)
+
+
+ (split_reads)
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/tool_collections/kraken/kraken/macros.xml b/tool_collections/kraken/kraken/macros.xml
new file mode 120000
index 00000000000..0c6ff6c3756
--- /dev/null
+++ b/tool_collections/kraken/kraken/macros.xml
@@ -0,0 +1 @@
+../macros.xml
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken/test-data/kraken_test1.fa b/tool_collections/kraken/kraken/test-data/kraken_test1.fa
new file mode 100644
index 00000000000..dfc5125a6a6
--- /dev/null
+++ b/tool_collections/kraken/kraken/test-data/kraken_test1.fa
@@ -0,0 +1,70 @@
+>gi|145231|gb|M33724.1|ECOALPHOA Escherichia coli K-12 truncated PhoA (phoA) gene, partial cds; and transposon Mu dI, partial sequence
+CAAAGCTCCGGGCCTCACCCAGGCGCTAAATACCAAAGATGGCGCAGTGATGGTGATGAGTTACGGGAAC
+TCCGAAGAGGATTCACAAGAACATACCGGCAGTCAGTTGCGTATTGCGGCGTATGGCCCGCATGCCGCCA
+ATGAAGCGGCGCACGAAAAACGCGAAAGCGT
+
+>gi|145232|gb|M33725.1|ECOALPHOB Escherichia coli K12 phoA pseudogene and transposon Mu dl-R, partial sequence
+CTGTCATAAAGTTGTCACGGCCGAGACTTATAGTCGCTTTGTTTTTATTTTTTAATGTATTTGTACATGG
+AGAAAATAAAGTGAAACAAAGCACTATTGCACTGGCACTCTTACCGTTACTGTTTACCCCTGTGACAAAA
+GCCCGGACACCAGTGAAGCGGCGCACGAAAAACGCGAAAGCGT
+
+>gi|145234|gb|M33727.1|ECOALPHOE Escherichia coli K12 upstream sequence of psiA5::Mu dI. is identical to psiA30 upstream sequence; putative (phoA) pseudogene and transposon Mu dl-R, partial sequence
+TTGTTTTTATTTTTTAATGTATTTGTACATGGAGAAAATAAAGTGAAACAAAGCACTATTGCACTGGTGA
+AGCGGCGCACGAAAAACGCGAAAGCGT
+
+>gi|146195|gb|J01619.1|ECOGLTA Eschericia coli gltA gene, sdhCDAB operon and sucABCD operons, complete sequence
+GAATTCGACCGCCATTGCGCAAGGCATCGCCATGACCAGGCAGGATACAAAAGAGAGTCGATAAATATTC
+ACGGTGTCCATACCTGATAAATATTTTATGAAAGGCGGCGATGATGCCGCAAAATAATACTTATTTATAA
+TCCAGCACGTAGGTTGCGTTAGCGGTTACTTCACCTGCCGTGACATCGACTGCATTATCAATTTGTTCCA
+TCCAGGCGAAAAAGTTCAGCGTCTGTTCTGATGAGCTTGCATCCAGGTCAAGATCTGGCGCGGCTGAACC
+TAATACGATGTTACCGTCATTTTTGTCCATCAGTCGTACACCGACCCCAGTTGCTTCGCCTGCACTGGTG
+TTGCTCAACAAAGGCGTAGCACCAGTTGTCTTAGCCGTGCTATCGAAGGTTACGCCAAACTTTGGATACC
+GGCATTCCGCTACCGTTGTCAGAAGCAGGCAGATCACAGTTGATCAAGCGAATGTCGACGGCCACTTTAT
+TGCTATGATGCTCCCGGTTTATATGGGTTGTCGTGACTTGTCCAAGATCTATGTTTTTATCAATATCTTC
+TGGATGAATTTCACAAGGTGCTTCAATAACCTCCCCCTTAAAGTGAATTTCGCCAGAACCTTCATCAGCA
+GCATAAACAGGTGCAGTGAACAGCAGAGATACGGCCAGTGCGGCCAATGTTTTTTGTCCTTTAAACATAA
+CAGAGTCCTTTAAGGATATAGAATAGGGGTATAGCTACGCCAGAATATCGTATTTGATTATTGCTAGTTT
+TTAGTTTTGCTTAAAAAATATTGTTAGTTTTATTAAATTGGAAAACTAAATTATTGGTATCATGAATTGT
+TGTATGATGATAAATATAGGGGGGATATGATAGACGTCATTTTCATAGGGTTATAAAATGCGACTACCAT
+GAAGTTTTTAATTCAAAGTATTGGGTTGCTGATAATTTGAGCTGTTCTATTCTTTTTAAATATCTATATA
+GGTCTGTTAATGGATTTTATTTTTACAAGTTTTTTGTGTTTAGGCATATAAAAATCAAGCCCGCCATATG
+AACGGCGGGTTAAAATATTTACAACTTAGCAATCGAACCATTAACGCTTGATATCGCTTTTAAAGTCGCG
+TTTTTCATATCCTGTATACAGCTGACGCGGACGGGCAATCTTCATACCGTCACTGTGCATTTCGCTCCAG
+TGGGCGATCCAGCCAACGGTACGTGCCATTGCGAAAATGACGGTGAACATGGAAGACGGAATACCCATCG
+CTTTCAGGATGATACCAGAGTAGAAATCGACGTTCGGGTACAGTTTCTTCTCGATAAAGTACGGGTCGTT
+CAGCGCGATGTTTTCCAGCTCCATAGCCACTTCCAGCAGGTCATCCTTCGTGCCCAGCTCTTTCAGCACT
+TCATGGCAGGTTTCACGCATTACGGTGGCGCGCGGGTCGTAATTTTTGTACACGCGGTGACCGAAGCCCA
+TCAGGCGGAAAGAATCATTTTTGTCTTTCGCACGACGAAAAAATTCCGGAATGTGTTTAACGGAGCTGAT
+TTCTTCCAGCATTTTCAGCGCCGCTTCGTTAGCACCGCCGTGCGCAGGTCCCCACAGTGAAGCAATACCT
+GCTGCGATACAGGCAAACGGGTTCGCACCCGAAGAGCCAGCGGTACGCACGGTGGAGGTAGAGGCGTTCT
+GTTCATGGTCAGCGTGCAGGATCAGAATACGGTCCATAGCACGTTCCAGAATCGGATTAACTTCATACGG
+TTCGCACGGCGTGGAGAACATCATATTCAGGAAGTTACCGGCGTAGGAGAGATCGTTGCGCGGGTAAACA
+AATGGCTGACCAATGGAATACTTGTAACACATCGCGGCCATGGTCGGCATTTTCGACAGCAGGCGGAACG
+CGGCAATTTCACGGTGACGAGGATTGTTAACATCCAGCGAGTCGTGATAGAACGCCGCCAGCGCGCCGGT
+AATACCACACATGACTGCCATTGGATGCGAGTCGCGACGGAAAGCATGGAACAGACGGGTAATCTGCTCG
+TGGATCATGGTATGACGGGTCACCGTAGTTTTAAATTCGTCATACTGTTCCTGAGTCGGTTTTTCACCAT
+TCAGCAGGATGTAACAAACTTCCAGGTAGTTAGAATCGGTCGCCAGCTGATCGATCGGGAAACCGCGGTG
+CAGCAAAATACCTTCATCACCATCAATAAAAGTAATTTTAGATTCGCAGGATGCGGTTGAAGTGAAGCCT
+GGGTCAAAGGTGAACACACCTTTTGAACCGAGAGTACGGATATCAATAACATCTTGACCCAGCGTGCCTT
+TCAGCACATCCAGTTCAACAGCTGTATCCCCGTTGAGGGTGAGTTTTGCTTTTGTATCAGCCATTTAAGG
+TCTCCTTAGCGCCTTATTGCGTAAGACTGCCGGAACTTAAATTTGCCTTCGCACATCAACCTGGCTTTAC
+CCGTTTTTTATTTGGCTCGCCGCTCTGTGAAAGAGGGGAAAACCTGGGTACAGAGCTCTGGGCGCTTGCA
+GGTAAAGGATCCATTGATGACGAATAAATGGCGAATCAAGTACTTAGCAATCCGAATTATTAAACTTGTC
+TACCACTAATAACTGTCCCGAATGAATTGGTCAATACTCCACACTGTTACATAAGTTAATCTTAGGTGAA
+ATACCGACTTCATAACTTTTACGCATTATATGCTTTTCCTGGTAATGTTTGTAACAACTTTGTTGAATGA
+TTGTCAAATTAGATGATTAAAAATTAAATAAATGTTGTTATCGTGACCTGGATCACTGTTCAGGATAAAA
+CCCGACAAACTATATGTAGGTTAATTGTAATGATTTTGTGAACAGCCTATACTGCCGCCAGTCTCCGGAA
+CACCCTGCAATCCCGAGCCACCCAGCGTTGTAACGTGTCGTTTTCGCATCTGGAAGCAGTGTTTTGCATG
+ACGCGCAGTTATAGAAAGGACGCTGTCTGACCCGCAAGCAGACCGGAGGAAGGAAATCCCGACGTCTCCA
+GGTAACAGAAAGTTAACCTCTGTGCCCGTAGTCCCCAGGGAATAATAAGAACAGCATGTGGGCGTTATTC
+ATGATAAGAAATGTGAAAAAACAAAGACCTGTTAATCTGGACCTACAGACCATCCGGTTCCCCATCACGG
+CGATAGCGTCCATTCTCCATCGCGTTTCCGGTGTGATCACCTTTGTTGCAGTGGGCATCCTGCTGTGGCT
+TCTGGGTACCAGCCTCTCTTCCCCTGAAGGTTTCGAGCAAGCTTCCGCGATTATGGGCAGCTTCTTCGTC
+AAATTTATCATGTGGGGCATCCTTACCGCTCTGGCGTATCACGTCGTCGTAGGTATTCGCCACATGATGA
+TGGATTTTGGCTATCTGGAAGAAACATTCGAAGCGGGTAAACGCTCCGCCAAAATCTCCTTTGTTATTAC
+TGTCGTGCTTTCACTTCTCGCAGGAGTCCTCGTATGGTAAGCAACGCCTCCGCATTAGGACGCAATGGCG
+TACATGATTTCATCCTCGTTCGCGCTACCGCTATCGTCCTGACGCTCTACATCATTTATATGGTCGGTTT
+TTTCGCTACCAGTGGCGAGCTGACATATGAAGTCTGGATCGGTTTCTTCGCCTCTGCGTTCACCAAAGTG
+TTCACCCTGCTGGCGCTGTTTTCTATCTTGATCCATGCCTGGATCGGCATGTGGCAGGTGTTGACCGACT
+ACGTTAAACCGCTGGCTTTGCGCCTGATGCTGCAACTGGTGATTGTCGTTGCACTGGTGGTTTACGTGAT
+TTATGGATTCGTTGTGGTGTGGGGTGTGTGATGAAATTGCCAGTCAGAGAATTTGATGCAGTTGTGATTG
diff --git a/tool_collections/kraken/kraken/test-data/kraken_test1_output.tab b/tool_collections/kraken/kraken/test-data/kraken_test1_output.tab
new file mode 100644
index 00000000000..2aa9e1ea8d5
--- /dev/null
+++ b/tool_collections/kraken/kraken/test-data/kraken_test1_output.tab
@@ -0,0 +1,4 @@
+C gi|145231|gb|M33724.1|ECOALPHOA 83333 171 83333:162
+C gi|145232|gb|M33725.1|ECOALPHOB 83333 183 83333:174
+C gi|145234|gb|M33727.1|ECOALPHOE 83333 97 83333:88
+C gi|146195|gb|J01619.1|ECOGLTA 83333 3850 83333:3841
diff --git a/tool_collections/kraken/kraken/test-data/test_database.loc b/tool_collections/kraken/kraken/test-data/test_database.loc
new file mode 120000
index 00000000000..03ec4336e45
--- /dev/null
+++ b/tool_collections/kraken/kraken/test-data/test_database.loc
@@ -0,0 +1 @@
+../../test_database.loc
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken/test-data/test_db/database.idx b/tool_collections/kraken/kraken/test-data/test_db/database.idx
new file mode 120000
index 00000000000..9251273d383
--- /dev/null
+++ b/tool_collections/kraken/kraken/test-data/test_db/database.idx
@@ -0,0 +1 @@
+../../../database.idx
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken/test-data/test_db/database.kdb b/tool_collections/kraken/kraken/test-data/test_db/database.kdb
new file mode 120000
index 00000000000..64efac5ce08
--- /dev/null
+++ b/tool_collections/kraken/kraken/test-data/test_db/database.kdb
@@ -0,0 +1 @@
+../../../database.kdb
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken/test-data/test_db/taxonomy/names.dmp b/tool_collections/kraken/kraken/test-data/test_db/taxonomy/names.dmp
new file mode 120000
index 00000000000..f4585471fc2
--- /dev/null
+++ b/tool_collections/kraken/kraken/test-data/test_db/taxonomy/names.dmp
@@ -0,0 +1 @@
+../../../../names.dmp
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken/test-data/test_db/taxonomy/nodes.dmp b/tool_collections/kraken/kraken/test-data/test_db/taxonomy/nodes.dmp
new file mode 120000
index 00000000000..a65d7d41f87
--- /dev/null
+++ b/tool_collections/kraken/kraken/test-data/test_db/taxonomy/nodes.dmp
@@ -0,0 +1 @@
+../../../../nodes.dmp
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken/tool-data/kraken_databases.loc.sample b/tool_collections/kraken/kraken/tool-data/kraken_databases.loc.sample
new file mode 120000
index 00000000000..eba8983e9ea
--- /dev/null
+++ b/tool_collections/kraken/kraken/tool-data/kraken_databases.loc.sample
@@ -0,0 +1 @@
+../../kraken_databases.loc.sample
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken/tool_data_table_conf.xml.sample b/tool_collections/kraken/kraken/tool_data_table_conf.xml.sample
new file mode 120000
index 00000000000..332baf440e9
--- /dev/null
+++ b/tool_collections/kraken/kraken/tool_data_table_conf.xml.sample
@@ -0,0 +1 @@
+../tool_data_table_conf.xml.sample
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken/tool_data_table_conf.xml.test b/tool_collections/kraken/kraken/tool_data_table_conf.xml.test
new file mode 120000
index 00000000000..f8050b61701
--- /dev/null
+++ b/tool_collections/kraken/kraken/tool_data_table_conf.xml.test
@@ -0,0 +1 @@
+../tool_data_table_conf.xml.test
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_databases.loc.sample b/tool_collections/kraken/kraken_databases.loc.sample
new file mode 100644
index 00000000000..e69de29bb2d
diff --git a/tool_collections/kraken/kraken_filter/.shed.yml b/tool_collections/kraken/kraken_filter/.shed.yml
new file mode 100644
index 00000000000..37aedb65a83
--- /dev/null
+++ b/tool_collections/kraken/kraken_filter/.shed.yml
@@ -0,0 +1,16 @@
+categories:
+- Metagenomics
+description: Kraken's confidence filter.
+homepage_url: http://ccb.jhu.edu/software/kraken/
+long_description: |
+ Kraken is a system for assigning taxonomic labels to short DNA
+ sequences, usually obtained through metagenomic studies. Previous attempts by other
+ bioinformatics software to accomplish this task have often used sequence alignment
+ or machine learning techniques that were quite slow, leading to the development
+ of less sensitive but much faster abundance estimation programs. Kraken aims to
+ achieve high sensitivity and high speed by utilizing exact alignments of k-mers
+ and a novel classification algorithm.
+name: kraken_filter
+owner: devteam
+remote_repository_url: https://github.com/galaxyproject/tools-iuc/blob/master/tool_collections/kraken/kraken_filter/
+type: unrestricted
diff --git a/tool_collections/kraken/kraken_filter/README.rst b/tool_collections/kraken/kraken_filter/README.rst
new file mode 120000
index 00000000000..89a0106941f
--- /dev/null
+++ b/tool_collections/kraken/kraken_filter/README.rst
@@ -0,0 +1 @@
+../README.rst
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_filter/kraken-filter.xml b/tool_collections/kraken/kraken_filter/kraken-filter.xml
new file mode 100644
index 00000000000..706cdb528a4
--- /dev/null
+++ b/tool_collections/kraken/kraken_filter/kraken-filter.xml
@@ -0,0 +1,66 @@
+
+ filter classification by confidence score
+
+ macros.xml
+
+
+
+ '$filtered_output'
+ ]]>
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/tool_collections/kraken/kraken_filter/macros.xml b/tool_collections/kraken/kraken_filter/macros.xml
new file mode 120000
index 00000000000..0c6ff6c3756
--- /dev/null
+++ b/tool_collections/kraken/kraken_filter/macros.xml
@@ -0,0 +1 @@
+../macros.xml
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_filter/test-data/kraken_filter_test1.tab b/tool_collections/kraken/kraken_filter/test-data/kraken_filter_test1.tab
new file mode 100644
index 00000000000..2aa9e1ea8d5
--- /dev/null
+++ b/tool_collections/kraken/kraken_filter/test-data/kraken_filter_test1.tab
@@ -0,0 +1,4 @@
+C gi|145231|gb|M33724.1|ECOALPHOA 83333 171 83333:162
+C gi|145232|gb|M33725.1|ECOALPHOB 83333 183 83333:174
+C gi|145234|gb|M33727.1|ECOALPHOE 83333 97 83333:88
+C gi|146195|gb|J01619.1|ECOGLTA 83333 3850 83333:3841
diff --git a/tool_collections/kraken/kraken_filter/test-data/kraken_filter_test1_output.tab b/tool_collections/kraken/kraken_filter/test-data/kraken_filter_test1_output.tab
new file mode 100644
index 00000000000..9050d543951
--- /dev/null
+++ b/tool_collections/kraken/kraken_filter/test-data/kraken_filter_test1_output.tab
@@ -0,0 +1,4 @@
+C gi|145231|gb|M33724.1|ECOALPHOA 83333 171 P=1.000 83333:162
+C gi|145232|gb|M33725.1|ECOALPHOB 83333 183 P=1.000 83333:174
+C gi|145234|gb|M33727.1|ECOALPHOE 83333 97 P=1.000 83333:88
+C gi|146195|gb|J01619.1|ECOGLTA 83333 3850 P=1.000 83333:3841
diff --git a/tool_collections/kraken/kraken_filter/test-data/kraken_test1.fa b/tool_collections/kraken/kraken_filter/test-data/kraken_test1.fa
new file mode 100644
index 00000000000..dfc5125a6a6
--- /dev/null
+++ b/tool_collections/kraken/kraken_filter/test-data/kraken_test1.fa
@@ -0,0 +1,70 @@
+>gi|145231|gb|M33724.1|ECOALPHOA Escherichia coli K-12 truncated PhoA (phoA) gene, partial cds; and transposon Mu dI, partial sequence
+CAAAGCTCCGGGCCTCACCCAGGCGCTAAATACCAAAGATGGCGCAGTGATGGTGATGAGTTACGGGAAC
+TCCGAAGAGGATTCACAAGAACATACCGGCAGTCAGTTGCGTATTGCGGCGTATGGCCCGCATGCCGCCA
+ATGAAGCGGCGCACGAAAAACGCGAAAGCGT
+
+>gi|145232|gb|M33725.1|ECOALPHOB Escherichia coli K12 phoA pseudogene and transposon Mu dl-R, partial sequence
+CTGTCATAAAGTTGTCACGGCCGAGACTTATAGTCGCTTTGTTTTTATTTTTTAATGTATTTGTACATGG
+AGAAAATAAAGTGAAACAAAGCACTATTGCACTGGCACTCTTACCGTTACTGTTTACCCCTGTGACAAAA
+GCCCGGACACCAGTGAAGCGGCGCACGAAAAACGCGAAAGCGT
+
+>gi|145234|gb|M33727.1|ECOALPHOE Escherichia coli K12 upstream sequence of psiA5::Mu dI. is identical to psiA30 upstream sequence; putative (phoA) pseudogene and transposon Mu dl-R, partial sequence
+TTGTTTTTATTTTTTAATGTATTTGTACATGGAGAAAATAAAGTGAAACAAAGCACTATTGCACTGGTGA
+AGCGGCGCACGAAAAACGCGAAAGCGT
+
+>gi|146195|gb|J01619.1|ECOGLTA Eschericia coli gltA gene, sdhCDAB operon and sucABCD operons, complete sequence
+GAATTCGACCGCCATTGCGCAAGGCATCGCCATGACCAGGCAGGATACAAAAGAGAGTCGATAAATATTC
+ACGGTGTCCATACCTGATAAATATTTTATGAAAGGCGGCGATGATGCCGCAAAATAATACTTATTTATAA
+TCCAGCACGTAGGTTGCGTTAGCGGTTACTTCACCTGCCGTGACATCGACTGCATTATCAATTTGTTCCA
+TCCAGGCGAAAAAGTTCAGCGTCTGTTCTGATGAGCTTGCATCCAGGTCAAGATCTGGCGCGGCTGAACC
+TAATACGATGTTACCGTCATTTTTGTCCATCAGTCGTACACCGACCCCAGTTGCTTCGCCTGCACTGGTG
+TTGCTCAACAAAGGCGTAGCACCAGTTGTCTTAGCCGTGCTATCGAAGGTTACGCCAAACTTTGGATACC
+GGCATTCCGCTACCGTTGTCAGAAGCAGGCAGATCACAGTTGATCAAGCGAATGTCGACGGCCACTTTAT
+TGCTATGATGCTCCCGGTTTATATGGGTTGTCGTGACTTGTCCAAGATCTATGTTTTTATCAATATCTTC
+TGGATGAATTTCACAAGGTGCTTCAATAACCTCCCCCTTAAAGTGAATTTCGCCAGAACCTTCATCAGCA
+GCATAAACAGGTGCAGTGAACAGCAGAGATACGGCCAGTGCGGCCAATGTTTTTTGTCCTTTAAACATAA
+CAGAGTCCTTTAAGGATATAGAATAGGGGTATAGCTACGCCAGAATATCGTATTTGATTATTGCTAGTTT
+TTAGTTTTGCTTAAAAAATATTGTTAGTTTTATTAAATTGGAAAACTAAATTATTGGTATCATGAATTGT
+TGTATGATGATAAATATAGGGGGGATATGATAGACGTCATTTTCATAGGGTTATAAAATGCGACTACCAT
+GAAGTTTTTAATTCAAAGTATTGGGTTGCTGATAATTTGAGCTGTTCTATTCTTTTTAAATATCTATATA
+GGTCTGTTAATGGATTTTATTTTTACAAGTTTTTTGTGTTTAGGCATATAAAAATCAAGCCCGCCATATG
+AACGGCGGGTTAAAATATTTACAACTTAGCAATCGAACCATTAACGCTTGATATCGCTTTTAAAGTCGCG
+TTTTTCATATCCTGTATACAGCTGACGCGGACGGGCAATCTTCATACCGTCACTGTGCATTTCGCTCCAG
+TGGGCGATCCAGCCAACGGTACGTGCCATTGCGAAAATGACGGTGAACATGGAAGACGGAATACCCATCG
+CTTTCAGGATGATACCAGAGTAGAAATCGACGTTCGGGTACAGTTTCTTCTCGATAAAGTACGGGTCGTT
+CAGCGCGATGTTTTCCAGCTCCATAGCCACTTCCAGCAGGTCATCCTTCGTGCCCAGCTCTTTCAGCACT
+TCATGGCAGGTTTCACGCATTACGGTGGCGCGCGGGTCGTAATTTTTGTACACGCGGTGACCGAAGCCCA
+TCAGGCGGAAAGAATCATTTTTGTCTTTCGCACGACGAAAAAATTCCGGAATGTGTTTAACGGAGCTGAT
+TTCTTCCAGCATTTTCAGCGCCGCTTCGTTAGCACCGCCGTGCGCAGGTCCCCACAGTGAAGCAATACCT
+GCTGCGATACAGGCAAACGGGTTCGCACCCGAAGAGCCAGCGGTACGCACGGTGGAGGTAGAGGCGTTCT
+GTTCATGGTCAGCGTGCAGGATCAGAATACGGTCCATAGCACGTTCCAGAATCGGATTAACTTCATACGG
+TTCGCACGGCGTGGAGAACATCATATTCAGGAAGTTACCGGCGTAGGAGAGATCGTTGCGCGGGTAAACA
+AATGGCTGACCAATGGAATACTTGTAACACATCGCGGCCATGGTCGGCATTTTCGACAGCAGGCGGAACG
+CGGCAATTTCACGGTGACGAGGATTGTTAACATCCAGCGAGTCGTGATAGAACGCCGCCAGCGCGCCGGT
+AATACCACACATGACTGCCATTGGATGCGAGTCGCGACGGAAAGCATGGAACAGACGGGTAATCTGCTCG
+TGGATCATGGTATGACGGGTCACCGTAGTTTTAAATTCGTCATACTGTTCCTGAGTCGGTTTTTCACCAT
+TCAGCAGGATGTAACAAACTTCCAGGTAGTTAGAATCGGTCGCCAGCTGATCGATCGGGAAACCGCGGTG
+CAGCAAAATACCTTCATCACCATCAATAAAAGTAATTTTAGATTCGCAGGATGCGGTTGAAGTGAAGCCT
+GGGTCAAAGGTGAACACACCTTTTGAACCGAGAGTACGGATATCAATAACATCTTGACCCAGCGTGCCTT
+TCAGCACATCCAGTTCAACAGCTGTATCCCCGTTGAGGGTGAGTTTTGCTTTTGTATCAGCCATTTAAGG
+TCTCCTTAGCGCCTTATTGCGTAAGACTGCCGGAACTTAAATTTGCCTTCGCACATCAACCTGGCTTTAC
+CCGTTTTTTATTTGGCTCGCCGCTCTGTGAAAGAGGGGAAAACCTGGGTACAGAGCTCTGGGCGCTTGCA
+GGTAAAGGATCCATTGATGACGAATAAATGGCGAATCAAGTACTTAGCAATCCGAATTATTAAACTTGTC
+TACCACTAATAACTGTCCCGAATGAATTGGTCAATACTCCACACTGTTACATAAGTTAATCTTAGGTGAA
+ATACCGACTTCATAACTTTTACGCATTATATGCTTTTCCTGGTAATGTTTGTAACAACTTTGTTGAATGA
+TTGTCAAATTAGATGATTAAAAATTAAATAAATGTTGTTATCGTGACCTGGATCACTGTTCAGGATAAAA
+CCCGACAAACTATATGTAGGTTAATTGTAATGATTTTGTGAACAGCCTATACTGCCGCCAGTCTCCGGAA
+CACCCTGCAATCCCGAGCCACCCAGCGTTGTAACGTGTCGTTTTCGCATCTGGAAGCAGTGTTTTGCATG
+ACGCGCAGTTATAGAAAGGACGCTGTCTGACCCGCAAGCAGACCGGAGGAAGGAAATCCCGACGTCTCCA
+GGTAACAGAAAGTTAACCTCTGTGCCCGTAGTCCCCAGGGAATAATAAGAACAGCATGTGGGCGTTATTC
+ATGATAAGAAATGTGAAAAAACAAAGACCTGTTAATCTGGACCTACAGACCATCCGGTTCCCCATCACGG
+CGATAGCGTCCATTCTCCATCGCGTTTCCGGTGTGATCACCTTTGTTGCAGTGGGCATCCTGCTGTGGCT
+TCTGGGTACCAGCCTCTCTTCCCCTGAAGGTTTCGAGCAAGCTTCCGCGATTATGGGCAGCTTCTTCGTC
+AAATTTATCATGTGGGGCATCCTTACCGCTCTGGCGTATCACGTCGTCGTAGGTATTCGCCACATGATGA
+TGGATTTTGGCTATCTGGAAGAAACATTCGAAGCGGGTAAACGCTCCGCCAAAATCTCCTTTGTTATTAC
+TGTCGTGCTTTCACTTCTCGCAGGAGTCCTCGTATGGTAAGCAACGCCTCCGCATTAGGACGCAATGGCG
+TACATGATTTCATCCTCGTTCGCGCTACCGCTATCGTCCTGACGCTCTACATCATTTATATGGTCGGTTT
+TTTCGCTACCAGTGGCGAGCTGACATATGAAGTCTGGATCGGTTTCTTCGCCTCTGCGTTCACCAAAGTG
+TTCACCCTGCTGGCGCTGTTTTCTATCTTGATCCATGCCTGGATCGGCATGTGGCAGGTGTTGACCGACT
+ACGTTAAACCGCTGGCTTTGCGCCTGATGCTGCAACTGGTGATTGTCGTTGCACTGGTGGTTTACGTGAT
+TTATGGATTCGTTGTGGTGTGGGGTGTGTGATGAAATTGCCAGTCAGAGAATTTGATGCAGTTGTGATTG
diff --git a/tool_collections/kraken/kraken_filter/test-data/test_database.loc b/tool_collections/kraken/kraken_filter/test-data/test_database.loc
new file mode 120000
index 00000000000..03ec4336e45
--- /dev/null
+++ b/tool_collections/kraken/kraken_filter/test-data/test_database.loc
@@ -0,0 +1 @@
+../../test_database.loc
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_filter/test-data/test_db/database.idx b/tool_collections/kraken/kraken_filter/test-data/test_db/database.idx
new file mode 120000
index 00000000000..9251273d383
--- /dev/null
+++ b/tool_collections/kraken/kraken_filter/test-data/test_db/database.idx
@@ -0,0 +1 @@
+../../../database.idx
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_filter/test-data/test_db/database.kdb b/tool_collections/kraken/kraken_filter/test-data/test_db/database.kdb
new file mode 120000
index 00000000000..64efac5ce08
--- /dev/null
+++ b/tool_collections/kraken/kraken_filter/test-data/test_db/database.kdb
@@ -0,0 +1 @@
+../../../database.kdb
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_filter/test-data/test_db/taxonomy/names.dmp b/tool_collections/kraken/kraken_filter/test-data/test_db/taxonomy/names.dmp
new file mode 120000
index 00000000000..f4585471fc2
--- /dev/null
+++ b/tool_collections/kraken/kraken_filter/test-data/test_db/taxonomy/names.dmp
@@ -0,0 +1 @@
+../../../../names.dmp
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_filter/test-data/test_db/taxonomy/nodes.dmp b/tool_collections/kraken/kraken_filter/test-data/test_db/taxonomy/nodes.dmp
new file mode 120000
index 00000000000..a65d7d41f87
--- /dev/null
+++ b/tool_collections/kraken/kraken_filter/test-data/test_db/taxonomy/nodes.dmp
@@ -0,0 +1 @@
+../../../../nodes.dmp
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_filter/tool-data/kraken_databases.loc.sample b/tool_collections/kraken/kraken_filter/tool-data/kraken_databases.loc.sample
new file mode 120000
index 00000000000..eba8983e9ea
--- /dev/null
+++ b/tool_collections/kraken/kraken_filter/tool-data/kraken_databases.loc.sample
@@ -0,0 +1 @@
+../../kraken_databases.loc.sample
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_filter/tool_data_table_conf.xml.sample b/tool_collections/kraken/kraken_filter/tool_data_table_conf.xml.sample
new file mode 120000
index 00000000000..332baf440e9
--- /dev/null
+++ b/tool_collections/kraken/kraken_filter/tool_data_table_conf.xml.sample
@@ -0,0 +1 @@
+../tool_data_table_conf.xml.sample
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_filter/tool_data_table_conf.xml.test b/tool_collections/kraken/kraken_filter/tool_data_table_conf.xml.test
new file mode 120000
index 00000000000..f8050b61701
--- /dev/null
+++ b/tool_collections/kraken/kraken_filter/tool_data_table_conf.xml.test
@@ -0,0 +1 @@
+../tool_data_table_conf.xml.test
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_report/.shed.yml b/tool_collections/kraken/kraken_report/.shed.yml
new file mode 100644
index 00000000000..344d485e952
--- /dev/null
+++ b/tool_collections/kraken/kraken_report/.shed.yml
@@ -0,0 +1,16 @@
+categories:
+- Metagenomics
+description: Kraken report.
+homepage_url: http://ccb.jhu.edu/software/kraken/
+long_description: |
+ Kraken is a system for assigning taxonomic labels to short DNA
+ sequences, usually obtained through metagenomic studies. Previous attempts by other
+ bioinformatics software to accomplish this task have often used sequence alignment
+ or machine learning techniques that were quite slow, leading to the development
+ of less sensitive but much faster abundance estimation programs. Kraken aims to
+ achieve high sensitivity and high speed by utilizing exact alignments of k-mers
+ and a novel classification algorithm.
+name: kraken_report
+owner: devteam
+remote_repository_url: https://github.com/galaxyproject/tools-iuc/blob/master/tool_collections/kraken/kraken_report/
+type: unrestricted
diff --git a/tool_collections/kraken/kraken_report/README.rst b/tool_collections/kraken/kraken_report/README.rst
new file mode 120000
index 00000000000..89a0106941f
--- /dev/null
+++ b/tool_collections/kraken/kraken_report/README.rst
@@ -0,0 +1 @@
+../README.rst
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_report/kraken-mpa-report.xml b/tool_collections/kraken/kraken_report/kraken-mpa-report.xml
new file mode 100644
index 00000000000..8b24450c4ef
--- /dev/null
+++ b/tool_collections/kraken/kraken_report/kraken-mpa-report.xml
@@ -0,0 +1,81 @@
+
+
+ view report of classification for multiple samples
+
+ macros.xml
+
+
+
+ '$output_report'
+ ]]>
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/tool_collections/kraken/kraken_report/kraken-report.xml b/tool_collections/kraken/kraken_report/kraken-report.xml
new file mode 100644
index 00000000000..062c09897c4
--- /dev/null
+++ b/tool_collections/kraken/kraken_report/kraken-report.xml
@@ -0,0 +1,56 @@
+
+ view sample report of a classification
+
+
+ macros.xml
+
+
+
+ '$output_report'
+ ]]>
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/tool_collections/kraken/kraken_report/macros.xml b/tool_collections/kraken/kraken_report/macros.xml
new file mode 120000
index 00000000000..0c6ff6c3756
--- /dev/null
+++ b/tool_collections/kraken/kraken_report/macros.xml
@@ -0,0 +1 @@
+../macros.xml
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_report/test-data/kraken_mpa_report_input1.tab b/tool_collections/kraken/kraken_report/test-data/kraken_mpa_report_input1.tab
new file mode 100644
index 00000000000..2aa9e1ea8d5
--- /dev/null
+++ b/tool_collections/kraken/kraken_report/test-data/kraken_mpa_report_input1.tab
@@ -0,0 +1,4 @@
+C gi|145231|gb|M33724.1|ECOALPHOA 83333 171 83333:162
+C gi|145232|gb|M33725.1|ECOALPHOB 83333 183 83333:174
+C gi|145234|gb|M33727.1|ECOALPHOE 83333 97 83333:88
+C gi|146195|gb|J01619.1|ECOGLTA 83333 3850 83333:3841
diff --git a/tool_collections/kraken/kraken_report/test-data/kraken_mpa_report_input2.tab b/tool_collections/kraken/kraken_report/test-data/kraken_mpa_report_input2.tab
new file mode 100644
index 00000000000..2aa9e1ea8d5
--- /dev/null
+++ b/tool_collections/kraken/kraken_report/test-data/kraken_mpa_report_input2.tab
@@ -0,0 +1,4 @@
+C gi|145231|gb|M33724.1|ECOALPHOA 83333 171 83333:162
+C gi|145232|gb|M33725.1|ECOALPHOB 83333 183 83333:174
+C gi|145234|gb|M33727.1|ECOALPHOE 83333 97 83333:88
+C gi|146195|gb|J01619.1|ECOGLTA 83333 3850 83333:3841
diff --git a/tool_collections/kraken/kraken_report/test-data/kraken_mpa_report_test1_output.tab b/tool_collections/kraken/kraken_report/test-data/kraken_mpa_report_test1_output.tab
new file mode 100644
index 00000000000..ac03dc264ef
--- /dev/null
+++ b/tool_collections/kraken/kraken_report/test-data/kraken_mpa_report_test1_output.tab
@@ -0,0 +1,8 @@
+#Sample ID kraken_mpa_report_input1.tab kraken_mpa_report_input2.tab
+d__Bacteria 4 4
+d__Bacteria|p__Proteobacteria 4 4
+d__Bacteria|p__Proteobacteria|c__Gammaproteobacteria 4 4
+d__Bacteria|p__Proteobacteria|c__Gammaproteobacteria|o__Enterobacteriales 4 4
+d__Bacteria|p__Proteobacteria|c__Gammaproteobacteria|o__Enterobacteriales|f__Enterobacteriaceae 4 4
+d__Bacteria|p__Proteobacteria|c__Gammaproteobacteria|o__Enterobacteriales|f__Enterobacteriaceae|g__Escherichia 4 4
+d__Bacteria|p__Proteobacteria|c__Gammaproteobacteria|o__Enterobacteriales|f__Enterobacteriaceae|g__Escherichia|s__Escherichia_coli 4 4
diff --git a/tool_collections/kraken/kraken_report/test-data/kraken_report_test1.tab b/tool_collections/kraken/kraken_report/test-data/kraken_report_test1.tab
new file mode 100644
index 00000000000..2aa9e1ea8d5
--- /dev/null
+++ b/tool_collections/kraken/kraken_report/test-data/kraken_report_test1.tab
@@ -0,0 +1,4 @@
+C gi|145231|gb|M33724.1|ECOALPHOA 83333 171 83333:162
+C gi|145232|gb|M33725.1|ECOALPHOB 83333 183 83333:174
+C gi|145234|gb|M33727.1|ECOALPHOE 83333 97 83333:88
+C gi|146195|gb|J01619.1|ECOGLTA 83333 3850 83333:3841
diff --git a/tool_collections/kraken/kraken_report/test-data/kraken_report_test1_output.tab b/tool_collections/kraken/kraken_report/test-data/kraken_report_test1_output.tab
new file mode 100644
index 00000000000..1fcaa3e8cd7
--- /dev/null
+++ b/tool_collections/kraken/kraken_report/test-data/kraken_report_test1_output.tab
@@ -0,0 +1,11 @@
+ 0.00 0 0 U 0 unclassified
+100.00 4 0 - 1 root
+100.00 4 0 - 131567 cellular organisms
+100.00 4 0 D 2 Bacteria
+100.00 4 0 P 1224 Proteobacteria
+100.00 4 0 C 1236 Gammaproteobacteria
+100.00 4 0 O 91347 Enterobacteriales
+100.00 4 0 F 543 Enterobacteriaceae
+100.00 4 0 G 561 Escherichia
+100.00 4 0 S 562 Escherichia coli
+100.00 4 4 - 83333 Escherichia coli K-12
diff --git a/tool_collections/kraken/kraken_report/test-data/test_database.loc b/tool_collections/kraken/kraken_report/test-data/test_database.loc
new file mode 120000
index 00000000000..03ec4336e45
--- /dev/null
+++ b/tool_collections/kraken/kraken_report/test-data/test_database.loc
@@ -0,0 +1 @@
+../../test_database.loc
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_report/test-data/test_db/database.idx b/tool_collections/kraken/kraken_report/test-data/test_db/database.idx
new file mode 120000
index 00000000000..9251273d383
--- /dev/null
+++ b/tool_collections/kraken/kraken_report/test-data/test_db/database.idx
@@ -0,0 +1 @@
+../../../database.idx
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_report/test-data/test_db/database.kdb b/tool_collections/kraken/kraken_report/test-data/test_db/database.kdb
new file mode 120000
index 00000000000..64efac5ce08
--- /dev/null
+++ b/tool_collections/kraken/kraken_report/test-data/test_db/database.kdb
@@ -0,0 +1 @@
+../../../database.kdb
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_report/test-data/test_db/taxonomy/names.dmp b/tool_collections/kraken/kraken_report/test-data/test_db/taxonomy/names.dmp
new file mode 120000
index 00000000000..f4585471fc2
--- /dev/null
+++ b/tool_collections/kraken/kraken_report/test-data/test_db/taxonomy/names.dmp
@@ -0,0 +1 @@
+../../../../names.dmp
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_report/test-data/test_db/taxonomy/nodes.dmp b/tool_collections/kraken/kraken_report/test-data/test_db/taxonomy/nodes.dmp
new file mode 120000
index 00000000000..a65d7d41f87
--- /dev/null
+++ b/tool_collections/kraken/kraken_report/test-data/test_db/taxonomy/nodes.dmp
@@ -0,0 +1 @@
+../../../../nodes.dmp
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_report/tool-data/kraken_databases.loc.sample b/tool_collections/kraken/kraken_report/tool-data/kraken_databases.loc.sample
new file mode 120000
index 00000000000..eba8983e9ea
--- /dev/null
+++ b/tool_collections/kraken/kraken_report/tool-data/kraken_databases.loc.sample
@@ -0,0 +1 @@
+../../kraken_databases.loc.sample
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_report/tool_data_table_conf.xml.sample b/tool_collections/kraken/kraken_report/tool_data_table_conf.xml.sample
new file mode 120000
index 00000000000..332baf440e9
--- /dev/null
+++ b/tool_collections/kraken/kraken_report/tool_data_table_conf.xml.sample
@@ -0,0 +1 @@
+../tool_data_table_conf.xml.sample
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_report/tool_data_table_conf.xml.test b/tool_collections/kraken/kraken_report/tool_data_table_conf.xml.test
new file mode 120000
index 00000000000..f8050b61701
--- /dev/null
+++ b/tool_collections/kraken/kraken_report/tool_data_table_conf.xml.test
@@ -0,0 +1 @@
+../tool_data_table_conf.xml.test
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_translate/.shed.yml b/tool_collections/kraken/kraken_translate/.shed.yml
new file mode 100644
index 00000000000..6e54a30ca82
--- /dev/null
+++ b/tool_collections/kraken/kraken_translate/.shed.yml
@@ -0,0 +1,16 @@
+homepage_url: http://ccb.jhu.edu/software/kraken/
+categories:
+- Metagenomics
+description: Kraken translate command.
+long_description: |
+ Kraken is a system for assigning taxonomic labels to short DNA
+ sequences, usually obtained through metagenomic studies. Previous attempts by other
+ bioinformatics software to accomplish this task have often used sequence alignment
+ or machine learning techniques that were quite slow, leading to the development
+ of less sensitive but much faster abundance estimation programs. Kraken aims to
+ achieve high sensitivity and high speed by utilizing exact alignments of k-mers
+ and a novel classification algorithm.
+name: kraken_translate
+owner: devteam
+remote_repository_url: https://github.com/galaxyproject/tools-iuc/blob/master/tool_collections/kraken/kraken_translate/
+type: unrestricted
diff --git a/tool_collections/kraken/kraken_translate/README.rst b/tool_collections/kraken/kraken_translate/README.rst
new file mode 120000
index 00000000000..89a0106941f
--- /dev/null
+++ b/tool_collections/kraken/kraken_translate/README.rst
@@ -0,0 +1 @@
+../README.rst
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_translate/kraken-translate.xml b/tool_collections/kraken/kraken_translate/kraken-translate.xml
new file mode 100644
index 00000000000..44ef1375deb
--- /dev/null
+++ b/tool_collections/kraken/kraken_translate/kraken-translate.xml
@@ -0,0 +1,61 @@
+
+ convert taxonomy IDs to names
+
+ macros.xml
+
+
+
+ '${translated}'
+ ]]>
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/tool_collections/kraken/kraken_translate/macros.xml b/tool_collections/kraken/kraken_translate/macros.xml
new file mode 120000
index 00000000000..0c6ff6c3756
--- /dev/null
+++ b/tool_collections/kraken/kraken_translate/macros.xml
@@ -0,0 +1 @@
+../macros.xml
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_translate/test-data/kraken_translate_test1.tab b/tool_collections/kraken/kraken_translate/test-data/kraken_translate_test1.tab
new file mode 100644
index 00000000000..2aa9e1ea8d5
--- /dev/null
+++ b/tool_collections/kraken/kraken_translate/test-data/kraken_translate_test1.tab
@@ -0,0 +1,4 @@
+C gi|145231|gb|M33724.1|ECOALPHOA 83333 171 83333:162
+C gi|145232|gb|M33725.1|ECOALPHOB 83333 183 83333:174
+C gi|145234|gb|M33727.1|ECOALPHOE 83333 97 83333:88
+C gi|146195|gb|J01619.1|ECOGLTA 83333 3850 83333:3841
diff --git a/tool_collections/kraken/kraken_translate/test-data/kraken_translate_test1_output.tab b/tool_collections/kraken/kraken_translate/test-data/kraken_translate_test1_output.tab
new file mode 100644
index 00000000000..e187cd5469c
--- /dev/null
+++ b/tool_collections/kraken/kraken_translate/test-data/kraken_translate_test1_output.tab
@@ -0,0 +1,4 @@
+gi|145231|gb|M33724.1|ECOALPHOA root;cellular organisms;Bacteria;Proteobacteria;Gammaproteobacteria;Enterobacteriales;Enterobacteriaceae;Escherichia;Escherichia coli;Escherichia coli K-12
+gi|145232|gb|M33725.1|ECOALPHOB root;cellular organisms;Bacteria;Proteobacteria;Gammaproteobacteria;Enterobacteriales;Enterobacteriaceae;Escherichia;Escherichia coli;Escherichia coli K-12
+gi|145234|gb|M33727.1|ECOALPHOE root;cellular organisms;Bacteria;Proteobacteria;Gammaproteobacteria;Enterobacteriales;Enterobacteriaceae;Escherichia;Escherichia coli;Escherichia coli K-12
+gi|146195|gb|J01619.1|ECOGLTA root;cellular organisms;Bacteria;Proteobacteria;Gammaproteobacteria;Enterobacteriales;Enterobacteriaceae;Escherichia;Escherichia coli;Escherichia coli K-12
diff --git a/tool_collections/kraken/kraken_translate/test-data/test_database.loc b/tool_collections/kraken/kraken_translate/test-data/test_database.loc
new file mode 120000
index 00000000000..03ec4336e45
--- /dev/null
+++ b/tool_collections/kraken/kraken_translate/test-data/test_database.loc
@@ -0,0 +1 @@
+../../test_database.loc
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_translate/test-data/test_db/database.idx b/tool_collections/kraken/kraken_translate/test-data/test_db/database.idx
new file mode 120000
index 00000000000..9251273d383
--- /dev/null
+++ b/tool_collections/kraken/kraken_translate/test-data/test_db/database.idx
@@ -0,0 +1 @@
+../../../database.idx
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_translate/test-data/test_db/database.kdb b/tool_collections/kraken/kraken_translate/test-data/test_db/database.kdb
new file mode 120000
index 00000000000..64efac5ce08
--- /dev/null
+++ b/tool_collections/kraken/kraken_translate/test-data/test_db/database.kdb
@@ -0,0 +1 @@
+../../../database.kdb
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_translate/test-data/test_db/taxonomy/names.dmp b/tool_collections/kraken/kraken_translate/test-data/test_db/taxonomy/names.dmp
new file mode 120000
index 00000000000..f4585471fc2
--- /dev/null
+++ b/tool_collections/kraken/kraken_translate/test-data/test_db/taxonomy/names.dmp
@@ -0,0 +1 @@
+../../../../names.dmp
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_translate/test-data/test_db/taxonomy/nodes.dmp b/tool_collections/kraken/kraken_translate/test-data/test_db/taxonomy/nodes.dmp
new file mode 120000
index 00000000000..a65d7d41f87
--- /dev/null
+++ b/tool_collections/kraken/kraken_translate/test-data/test_db/taxonomy/nodes.dmp
@@ -0,0 +1 @@
+../../../../nodes.dmp
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_translate/tool-data/kraken_databases.loc.sample b/tool_collections/kraken/kraken_translate/tool-data/kraken_databases.loc.sample
new file mode 120000
index 00000000000..eba8983e9ea
--- /dev/null
+++ b/tool_collections/kraken/kraken_translate/tool-data/kraken_databases.loc.sample
@@ -0,0 +1 @@
+../../kraken_databases.loc.sample
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_translate/tool_data_table_conf.xml.sample b/tool_collections/kraken/kraken_translate/tool_data_table_conf.xml.sample
new file mode 120000
index 00000000000..332baf440e9
--- /dev/null
+++ b/tool_collections/kraken/kraken_translate/tool_data_table_conf.xml.sample
@@ -0,0 +1 @@
+../tool_data_table_conf.xml.sample
\ No newline at end of file
diff --git a/tool_collections/kraken/kraken_translate/tool_data_table_conf.xml.test b/tool_collections/kraken/kraken_translate/tool_data_table_conf.xml.test
new file mode 120000
index 00000000000..f8050b61701
--- /dev/null
+++ b/tool_collections/kraken/kraken_translate/tool_data_table_conf.xml.test
@@ -0,0 +1 @@
+../tool_data_table_conf.xml.test
\ No newline at end of file
diff --git a/tool_collections/kraken/macros.xml b/tool_collections/kraken/macros.xml
new file mode 100644
index 00000000000..1a47b71a032
--- /dev/null
+++ b/tool_collections/kraken/macros.xml
@@ -0,0 +1,27 @@
+
+
+
+
+ kraken
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+ 10.1186/gb-2014-15-3-r46
+
+
+ --db ${kraken_database.fields.name}
+ export KRAKEN_DB_PATH="${kraken_database.fields.path}"
+
diff --git a/tool_collections/kraken/names.dmp b/tool_collections/kraken/names.dmp
new file mode 100644
index 00000000000..fa5e7ef33d4
--- /dev/null
+++ b/tool_collections/kraken/names.dmp
@@ -0,0 +1,74 @@
+83333 | Escherichia coli K-12 | | scientific name |
+83333 | Escherichia coli K12 | | equivalent name |
+562 | "Bacillus coli" Migula 1895 | | authority |
+562 | "Bacterium coli commune" Escherich 1885 | | authority |
+562 | "Bacterium coli" (Migula 1895) Lehmann and Neumann 1896 | | authority |
+562 | ATCC 11775 | | type material |
+562 | Bacillus coli | | synonym |
+562 | Bacterium coli | | synonym |
+562 | Bacterium coli commune | | synonym |
+562 | CCUG 24 | | type material |
+562 | CCUG 29300 | | type material |
+562 | CIP 54.8 | | type material |
+562 | DSM 30083 | | type material |
+562 | Enterococcus coli | | synonym |
+562 | Escherchia coli | | misspelling |
+562 | Escherichia coli | | scientific name |
+562 | Escherichia coli (Migula 1895) Castellani and Chalmers 1919 | | authority |
+562 | Escherichia sp. MAR | | includes |
+562 | Escherichia/Shigella coli | | equivalent name |
+562 | Eschericia coli | | misspelling |
+562 | JCM 1649 | | type material |
+562 | LMG 2092 | | type material |
+562 | NBRC 102203 | | type material |
+562 | NCCB 54008 | | type material |
+562 | NCTC 9001 | | type material |
+562 | bacterium 10a | | includes |
+562 | bacterium E3 | | includes |
+561 | Escherchia | | misspelling |
+561 | Escherichia | | scientific name |
+561 | Escherichia Castellani and Chalmers 1919 | | authority |
+543 | Enterobacteraceae | | synonym |
+543 | Enterobacteraceae (ex Lapage 1979) Lapage 1982, fam. nov., nom. rev. | | synonym |
+543 | Enterobacteriaceae | | scientific name |
+543 | Enterobacteriaceae (ex Rahn 1937) Ewing et al. 1980, fam. nov., nom. rev. | | synonym |
+543 | Enterobacteriaceae Rahn 1937 | | synonym |
+543 | gamma-3 proteobacteria | gamma-3 proteobacteria <#1> | in-part |
+91347 | 'Enterobacteriales' | | synonym |
+91347 | Enterobacteriaceae and related endosymbionts | | synonym |
+91347 | Enterobacteriaceae group | | synonym |
+91347 | Enterobacteriales | | scientific name |
+91347 | enterobacteria | enterobacteria | blast name |
+91347 | gamma-3 proteobacteria | gamma-3 proteobacteria <#5> | in-part |
+1236 | Gammaproteobacteria | | scientific name |
+1236 | Gammaproteobacteria Garrity et al. 2005 | | synonym |
+1236 | Proteobacteria gamma subdivision | | synonym |
+1236 | Purple bacteria, gamma subdivision | | synonym |
+1236 | g-proteobacteria | gamma proteos | blast name |
+1236 | gamma proteobacteria | | synonym |
+1236 | gamma subdivision | | synonym |
+1236 | gamma subgroup | | synonym |
+1224 | Proteobacteria | | scientific name |
+1224 | Proteobacteria Garrity et al. 2005 | | authority |
+1224 | Proteobacteria [class] Stackebrandt et al. 1988 | | authority |
+1224 | not Proteobacteria Cavalier-Smith 2002 | | authority |
+1224 | proteobacteria | proteobacteria | blast name |
+1224 | purple bacteria | | common name |
+1224 | purple bacteria and relatives | | common name |
+1224 | purple non-sulfur bacteria | | common name |
+1224 | purple photosynthetic bacteria | | common name |
+1224 | purple photosynthetic bacteria and relatives | | common name |
+2 | Bacteria | Bacteria | scientific name |
+2 | Monera | Monera | in-part |
+2 | Procaryotae | Procaryotae | in-part |
+2 | Prokaryota | Prokaryota | in-part |
+2 | Prokaryotae | Prokaryotae | in-part |
+2 | bacteria | bacteria | blast name |
+2 | eubacteria | | genbank common name |
+2 | not Bacteria Haeckel 1894 | | synonym |
+2 | prokaryote | prokaryote | in-part |
+2 | prokaryotes | prokaryotes | in-part |
+1 | all | | synonym |
+1 | root | | scientific name |
+131567 | biota | | synonym |
+131567 | cellular organisms | | scientific name |
diff --git a/tool_collections/kraken/nodes.dmp b/tool_collections/kraken/nodes.dmp
new file mode 100644
index 00000000000..4e292c27101
--- /dev/null
+++ b/tool_collections/kraken/nodes.dmp
@@ -0,0 +1,10 @@
+83333 | 562 | no rank | | 0 | 1 | 11 | 1 | 0 | 1 | 1 | 0 | |
+562 | 561 | species | EC | 0 | 1 | 11 | 1 | 0 | 1 | 1 | 0 | |
+561 | 543 | genus | | 0 | 1 | 11 | 1 | 0 | 1 | 0 | 0 | |
+543 | 91347 | family | | 0 | 1 | 11 | 1 | 0 | 1 | 0 | 0 | |
+91347 | 1236 | order | | 0 | 1 | 11 | 1 | 0 | 1 | 0 | 0 | |
+1236 | 1224 | class | | 0 | 1 | 11 | 1 | 0 | 1 | 0 | 0 | |
+1224 | 2 | phylum | | 0 | 1 | 11 | 1 | 0 | 1 | 0 | 0 | |
+2 | 131567 | superkingdom | | 0 | 0 | 11 | 0 | 0 | 0 | 0 | 0 | |
+131567 | 1 | no rank | | 8 | 1 | 1 | 1 | 0 | 1 | 1 | 0 | |
+1 | 1 | no rank | | 8 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | |
diff --git a/tool_collections/kraken/test_database.loc b/tool_collections/kraken/test_database.loc
new file mode 100644
index 00000000000..8441f152d69
--- /dev/null
+++ b/tool_collections/kraken/test_database.loc
@@ -0,0 +1 @@
+test_db test_db ${__HERE__}
\ No newline at end of file
diff --git a/tool_collections/kraken/tool_data_table_conf.xml.sample b/tool_collections/kraken/tool_data_table_conf.xml.sample
new file mode 100644
index 00000000000..c7763ac28af
--- /dev/null
+++ b/tool_collections/kraken/tool_data_table_conf.xml.sample
@@ -0,0 +1,8 @@
+
+
+
+
+
\ No newline at end of file
diff --git a/tool_collections/samtools/samtools_bedcov/.shed.yml b/tool_collections/samtools/samtools_bedcov/.shed.yml
new file mode 100644
index 00000000000..dd880132524
--- /dev/null
+++ b/tool_collections/samtools/samtools_bedcov/.shed.yml
@@ -0,0 +1,11 @@
+categories:
+- SAM
+- Sequence Analysis
+description: Calculate read depth on BAM files
+long_description: |
+ This tool uses the SAMtools toolkit to produce read depth per BED region.
+name: samtools_bedcov
+owner: devteam
+homepage_url: http://www.htslib.org/
+remote_repository_url: https://github.com/galaxyproject/tools-iuc/tree/master/tool_collections/samtools/samtools_bedcov
+type: unrestricted
diff --git a/tool_collections/samtools/samtools_bedcov/macros.xml b/tool_collections/samtools/samtools_bedcov/macros.xml
new file mode 120000
index 00000000000..0c6ff6c3756
--- /dev/null
+++ b/tool_collections/samtools/samtools_bedcov/macros.xml
@@ -0,0 +1 @@
+../macros.xml
\ No newline at end of file
diff --git a/tool_collections/samtools/samtools_bedcov/samtools_bedcov.xml b/tool_collections/samtools/samtools_bedcov/samtools_bedcov.xml
new file mode 100644
index 00000000000..2172e5d2e00
--- /dev/null
+++ b/tool_collections/samtools/samtools_bedcov/samtools_bedcov.xml
@@ -0,0 +1,52 @@
+
+ calculate read depth for a set of genomic intervals
+
+ macros.xml
+
+
+
+
+ '${output}'
+ ]]>
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+**What it does**
+
+Calculates read depth for regions listed in a BED dataset using ``samtools bedcov`` command::
+
+ samtools bedcov [INPUT BED] [INPUT BAM1] ... [INPUT BAMn] > [OUTPUT]
+
+
+
+
+
diff --git a/tool_collections/samtools/samtools_bedcov/test-data/eboVir3.1.bed b/tool_collections/samtools/samtools_bedcov/test-data/eboVir3.1.bed
new file mode 100644
index 00000000000..d25dd55f082
--- /dev/null
+++ b/tool_collections/samtools/samtools_bedcov/test-data/eboVir3.1.bed
@@ -0,0 +1,3 @@
+eboVir3 500 1500
+eboVir3 1500 2000
+eboVir3 1500 3000
diff --git a/tool_collections/samtools/samtools_bedcov/test-data/eboVir3.2.bam b/tool_collections/samtools/samtools_bedcov/test-data/eboVir3.2.bam
new file mode 100644
index 00000000000..2fdd7354b8b
Binary files /dev/null and b/tool_collections/samtools/samtools_bedcov/test-data/eboVir3.2.bam differ
diff --git a/tool_collections/samtools/samtools_bedcov/test-data/eboVir3.2.bed b/tool_collections/samtools/samtools_bedcov/test-data/eboVir3.2.bed
new file mode 100644
index 00000000000..d25dd55f082
--- /dev/null
+++ b/tool_collections/samtools/samtools_bedcov/test-data/eboVir3.2.bed
@@ -0,0 +1,3 @@
+eboVir3 500 1500
+eboVir3 1500 2000
+eboVir3 1500 3000
diff --git a/tool_collections/samtools/samtools_bedcov/test-data/eboVir3.bam b/tool_collections/samtools/samtools_bedcov/test-data/eboVir3.bam
new file mode 100644
index 00000000000..c0e64957d92
Binary files /dev/null and b/tool_collections/samtools/samtools_bedcov/test-data/eboVir3.bam differ
diff --git a/tool_collections/samtools/samtools_bedcov/test-data/samtools_bedcov_out1.tab b/tool_collections/samtools/samtools_bedcov/test-data/samtools_bedcov_out1.tab
new file mode 100644
index 00000000000..ce32b596e0b
--- /dev/null
+++ b/tool_collections/samtools/samtools_bedcov/test-data/samtools_bedcov_out1.tab
@@ -0,0 +1,3 @@
+eboVir3 500 1500 7919
+eboVir3 1500 2000 3009
+eboVir3 1500 3000 9986
diff --git a/tool_collections/samtools/samtools_bedcov/test-data/samtools_bedcov_out2.tab b/tool_collections/samtools/samtools_bedcov/test-data/samtools_bedcov_out2.tab
new file mode 100644
index 00000000000..4a12f060dfe
--- /dev/null
+++ b/tool_collections/samtools/samtools_bedcov/test-data/samtools_bedcov_out2.tab
@@ -0,0 +1,3 @@
+eboVir3 500 1500 7919 2094
+eboVir3 1500 2000 3009 1064
+eboVir3 1500 3000 9986 3455
diff --git a/tool_collections/samtools/samtools_calmd/.shed.yml b/tool_collections/samtools/samtools_calmd/.shed.yml
new file mode 100644
index 00000000000..5a33b501fc9
--- /dev/null
+++ b/tool_collections/samtools/samtools_calmd/.shed.yml
@@ -0,0 +1,10 @@
+categories:
+- SAM
+description: recalculate MD/NM tags
+long_description: |
+ Generates the MD tag using ``samtools calmd`` command.
+name: samtools_calmd
+owner: devteam
+homepage_url: http://www.htslib.org/
+remote_repository_url: https://github.com/galaxyproject/tools-iuc/tree/master/tool_collections/samtools/samtools_calmd
+type: unrestricted
diff --git a/tool_collections/samtools/samtools_calmd/macros.xml b/tool_collections/samtools/samtools_calmd/macros.xml
new file mode 120000
index 00000000000..0c6ff6c3756
--- /dev/null
+++ b/tool_collections/samtools/samtools_calmd/macros.xml
@@ -0,0 +1 @@
+../macros.xml
\ No newline at end of file
diff --git a/tool_collections/samtools/samtools_calmd/samtools_calmd.xml b/tool_collections/samtools/samtools_calmd/samtools_calmd.xml
new file mode 100644
index 00000000000..c7eb1450694
--- /dev/null
+++ b/tool_collections/samtools/samtools_calmd/samtools_calmd.xml
@@ -0,0 +1,103 @@
+
+ recalculate MD/NM tags
+
+ macros.xml
+
+
+
+
+ "$calmd_output"
+ ]]>
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+**What it does**
+
+Generates the MD tag using ``samtools calmd`` command. If the MD tag (see SAM format refernce below for explanation of SAM/BAM tags) is already present, this command will give a warning if the MD tag generated is different from the existing tag. Outputs a BAM file. The command has the following options::
+
+ -e change identical bases to '='
+ -A modify the quality string
+ -r compute the BQ tag (without -A) or cap baseQ by BAQ (with -A)
+ -E extended BAQ for better sensitivity but lower specificity
+
+-----
+
+**NM and MD tags**
+
+From SAM format specification::
+
+ MD (string) String for mismatching positions. Regex : [0-9]+(([A-Z]|\^[A-Z]+)[0-9]+)*7
+ NM (indeger) Edit distance to the reference, including ambiguous bases but excluding clipping
+
+See refernces for more information about SAM format tags.
+
+
+
+
diff --git a/tool_collections/samtools/samtools_calmd/test-data/phiX.bam b/tool_collections/samtools/samtools_calmd/test-data/phiX.bam
new file mode 100644
index 00000000000..1375552cf36
Binary files /dev/null and b/tool_collections/samtools/samtools_calmd/test-data/phiX.bam differ
diff --git a/tool_collections/samtools/samtools_calmd/test-data/phiX.fasta b/tool_collections/samtools/samtools_calmd/test-data/phiX.fasta
new file mode 100644
index 00000000000..53df885dc48
--- /dev/null
+++ b/tool_collections/samtools/samtools_calmd/test-data/phiX.fasta
@@ -0,0 +1,79 @@
+>phiX174
+GAGTTTTATCGCTTCCATGACGCAGAAGTTAACACTTTCGGATATTTCTGATGAGTCGAAAAATTATCTT
+GATAAAGCAGGAATTACTACTGCTTGTTTACGAATTAAATCGAAGTGGACTGCTGGCGGAAAATGAGAAA
+ATTCGACCTATCCTTGCGCAGCTCGAGAAGCTCTTACTTTGCGACCTTTCGCCATCAACTAACGATTCTG
+TCAAAAACTGACGCGTTGGATGAGGAGAAGTGGCTTAATATGCTTGGCACGTTCGTCAAGGACTGGTTTA
+GATATGAGTCACATTTTGTTCATGGTAGAGATTCTCTTGTTGACATTTTAAAAGAGCGTGGATTACTATC
+TGAGTCCGATGCTGTTCAACCACTAATAGGTAAGAAATCATGAGTCAAGTTACTGAACAATCCGTACGTT
+TCCAGACCGCTTTGGCCTCTATTAAGCTCATTCAGGCTTCTGCCGTTTTGGATTTAACCGAAGATGATTT
+CGATTTTCTGACGAGTAACAAAGTTTGGATTGCTACTGACCGCTCTCGTGCTCGTCGCTGCGTTGAGGCT
+TGCGTTTATGGTACGCTGGACTTTGTGGGATACCCTCGCTTTCCTGCTCCTGTTGAGTTTATTGCTGCCG
+TCATTGCTTATTATGTTCATCCCGTCAACATTCAAACGGCCTGTCTCATCATGGAAGGCGCTGAATTTAC
+GGAAAACATTATTAATGGCGTCGAGCGTCCGGTTAAAGCCGCTGAATTGTTCGCGTTTACCTTGCGTGTA
+CGCGCAGGAAACACTGACGTTCTTACTGACGCAGAAGAAAACGTGCGTCAAAAATTACGTGCAGAAGGAG
+TGATGTAATGTCTAAAGGTAAAAAACGTTCTGGCGCTCGCCCTGGTCGTCCGCAGCCGTTGCGAGGTACT
+AAAGGCAAGCGTAAAGGCGCTCGTCTTTGGTATGTAGGTGGTCAACAATTTTAATTGCAGGGGCTTCGGC
+CCCTTACTTGAGGATAAATTATGTCTAATATTCAAACTGGCGCCGAGCGTATGCCGCATGACCTTTCCCA
+TCTTGGCTTCCTTGCTGGTCAGATTGGTCGTCTTATTACCATTTCAACTACTCCGGTTATCGCTGGCGAC
+TCCTTCGAGATGGACGCCGTTGGCGCTCTCCGTCTTTCTCCATTGCGTCGTGGCCTTGCTATTGACTCTA
+CTGTAGACATTTTTACTTTTTATGTCCCTCATCGTCACGTTTATGGTGAACAGTGGATTAAGTTCATGAA
+GGATGGTGTTAATGCCACTCCTCTCCCGACTGTTAACACTACTGGTTATATTGACCATGCCGCTTTTCTT
+GGCACGATTAACCCTGATACCAATAAAATCCCTAAGCATTTGTTTCAGGGTTATTTGAATATCTATAACA
+ACTATTTTAAAGCGCCGTGGATGCCTGACCGTACCGAGGCTAACCCTAATGAGCTTAATCAAGATGATGC
+TCGTTATGGTTTCCGTTGCTGCCATCTCAAAAACATTTGGACTGCTCCGCTTCCTCCTGAGACTGAGCTT
+TCTCGCCAAATGACGACTTCTACCACATCTATTGACATTATGGGTCTGCAAGCTGCTTATGCTAATTTGC
+ATACTGACCAAGAACGTGATTACTTCATGCAGCGTTACCGTGATGTTATTTCTTCATTTGGAGGTAAAAC
+CTCTTATGACGCTGACAACCGTCCTTTACTTGTCATGCGCTCTAATCTCTGGGCATCTGGCTATGATGTT
+GATGGAACTGACCAAACGTCGTTAGGCCAGTTTTCTGGTCGTGTTCAACAGACCTATAAACATTCTGTGC
+CGCGTTTCTTTGTTCCTGAGCATGGCACTATGTTTACTCTTGCGCTTGTTCGTTTTCCGCCTACTGCGAC
+TAAAGAGATTCAGTACCTTAACGCTAAAGGTGCTTTGACTTATACCGATATTGCTGGCGACCCTGTTTTG
+TATGGCAACTTGCCGCCGCGTGAAATTTCTATGAAGGATGTTTTCCGTTCTGGTGATTCGTCTAAGAAGT
+TTAAGATTGCTGAGGGTCAGTGGTATCGTTATGCGCCTTCGTATGTTTCTCCTGCTTATCACCTTCTTGA
+AGGCTTCCCATTCATTCAGGAACCGCCTTCTGGTGATTTGCAAGAACGCGTACTTATTCGCCACCATGAT
+TATGACCAGTGTTTCCAGTCCGTTCAGTTGTTGCAGTGGAATAGTCAGGTTAAATTTAATGTGACCGTTT
+ATCGCAATCTGCCGACCACTCGCGATTCAATCATGACTTCGTGATAAAAGATTGAGTGTGAGGTTATAAC
+GCCGAAGCGGTAAAAATTTTAATTTTTGCCGCTGAGGGGTTGACCAAGCGAAGCGCGGTAGGTTTTCTGC
+TTAGGAGTTTAATCATGTTTCAGACTTTTATTTCTCGCCATAATTCAAACTTTTTTTCTGATAAGCTGGT
+TCTCACTTCTGTTACTCCAGCTTCTTCGGCACCTGTTTTACAGACACCTAAAGCTACATCGTCAACGTTA
+TATTTTGATAGTTTGACGGTTAATGCTGGTAATGGTGGTTTTCTTCATTGCATTCAGATGGATACATCTG
+TCAACGCCGCTAATCAGGTTGTTTCTGTTGGTGCTGATATTGCTTTTGATGCCGACCCTAAATTTTTTGC
+CTGTTTGGTTCGCTTTGAGTCTTCTTCGGTTCCGACTACCCTCCCGACTGCCTATGATGTTTATCCTTTG
+AATGGTCGCCATGATGGTGGTTATTATACCGTCAAGGACTGTGTGACTATTGACGTCCTTCCCCGTACGC
+CGGGCAATAATGTTTATGTTGGTTTCATGGTTTGGTCTAACTTTACCGCTACTAAATGCCGCGGATTGGT
+TTCGCTGAATCAGGTTATTAAAGAGATTATTTGTCTCCAGCCACTTAAGTGAGGTGATTTATGTTTGGTG
+CTATTGCTGGCGGTATTGCTTCTGCTCTTGCTGGTGGCGCCATGTCTAAATTGTTTGGAGGCGGTCAAAA
+AGCCGCCTCCGGTGGCATTCAAGGTGATGTGCTTGCTACCGATAACAATACTGTAGGCATGGGTGATGCT
+GGTATTAAATCTGCCATTCAAGGCTCTAATGTTCCTAACCCTGATGAGGCCGCCCCTAGTTTTGTTTCTG
+GTGCTATGGCTAAAGCTGGTAAAGGACTTCTTGAAGGTACGTTGCAGGCTGGCACTTCTGCCGTTTCTGA
+TAAGTTGCTTGATTTGGTTGGACTTGGTGGCAAGTCTGCCGCTGATAAAGGAAAGGATACTCGTGATTAT
+CTTGCTGCTGCATTTCCTGAGCTTAATGCTTGGGAGCGTGCTGGTGCTGATGCTTCCTCTGCTGGTATGG
+TTGACGCCGGATTTGAGAATCAAAAAGAGCTTACTAAAATGCAACTGGACAATCAGAAAGAGATTGCCGA
+GATGCAAAATGAGACTCAAAAAGAGATTGCTGGCATTCAGTCGGCGACTTCACGCCAGAATACGAAAGAC
+CAGGTATATGCACAAAATGAGATGCTTGCTTATCAACAGAAGGAGTCTACTGCTCGCGTTGCGTCTATTA
+TGGAAAACACCAATCTTTCCAAGCAACAGCAGGTTTCCGAGATTATGCGCCAAATGCTTACTCAAGCTCA
+AACGGCTGGTCAGTATTTTACCAATGACCAAATCAAAGAAATGACTCGCAAGGTTAGTGCTGAGGTTGAC
+TTAGTTCATCAGCAAACGCAGAATCAGCGGTATGGCTCTTCTCATATTGGCGCTACTGCAAAGGATATTT
+CTAATGTCGTCACTGATGCTGCTTCTGGTGTGGTTGATATTTTTCATGGTATTGATAAAGCTGTTGCCGA
+TACTTGGAACAATTTCTGGAAAGACGGTAAAGCTGATGGTATTGGCTCTAATTTGTCTAGGAAATAACCG
+TCAGGATTGACACCCTCCCAATTGTATGTTTTCATGCCTCCAAATCTTGGAGGCTTTTTTATGGTTCGTT
+CTTATTACCCTTCTGAATGTCACGCTGATTATTTTGACTTTGAGCGTATCGAGGCTCTTAAACCTGCTAT
+TGAGGCTTGTGGCATTTCTACTCTTTCTCAATCCCCAATGCTTGGCTTCCATAAGCAGATGGATAACCGC
+ATCAAGCTCTTGGAAGAGATTCTGTCTTTTCGTATGCAGGGCGTTGAGTTCGATAATGGTGATATGTATG
+TTGACGGCCATAAGGCTGCTTCTGACGTTCGTGATGAGTTTGTATCTGTTACTGAGAAGTTAATGGATGA
+ATTGGCACAATGCTACAATGTGCTCCCCCAACTTGATATTAATAACACTATAGACCACCGCCCCGAAGGG
+GACGAAAAATGGTTTTTAGAGAACGAGAAGACGGTTACGCAGTTTTGCCGCAAGCTGGCTGCTGAACGCC
+CTCTTAAGGATATTCGCGATGAGTATAATTACCCCAAAAAGAAAGGTATTAAGGATGAGTGTTCAAGATT
+GCTGGAGGCCTCCACTATGAAATCGCGTAGAGGCTTTACTATTCAGCGTTTGATGAATGCAATGCGACAG
+GCTCATGCTGATGGTTGGTTTATCGTTTTTGACACTCTCACGTTGGCTGACGACCGATTAGAGGCGTTTT
+ATGATAATCCCAATGCTTTGCGTGACTATTTTCGTGATATTGGTCGTATGGTTCTTGCTGCCGAGGGTCG
+CAAGGCTAATGATTCACACGCCGACTGCTATCAGTATTTTTGTGTGCCTGAGTATGGTACAGCTAATGGC
+CGTCTTCATTTCCATGCGGTGCATTTTATGCGGACACTTCCTACAGGTAGCGTTGACCCTAATTTTGGTC
+GTCGGGTACGCAATCGCCGCCAGTTAAATAGCTTGCAAAATACGTGGCCTTATGGTTACAGTATGCCCAT
+CGCAGTTCGCTACACGCAGGACGCTTTTTCACGTTCTGGTTGGTTGTGGCCTGTTGATGCTAAAGGTGAG
+CCGCTTAAAGCTACCAGTTATATGGCTGTTGGTTTCTATGTGGCTAAATACGTTAACAAAAAGTCAGATA
+TGGACCTTGCTGCTAAAGGTCTAGGAGCTAAAGAATGGAACAACTCACTAAAAACCAAGCTGTCGCTACT
+TCCCAAGAAGCTGTTCAGAATCAGAATGAGCCGCAACTTCGGGATGAAAATGCTCACAATGACAAATCTG
+TCCACGGAGTGCTTAATCCAACTTACCAAGCTGGGTTACGACGCGACGCCGTTCAACCAGATATTGAAGC
+AGAACGCAAAAAGAGAGATGAGATTGAGGCTGGGAAAAGTTACTGTAGCCGACGTTTTGGCGGCGCAACC
+TGTGACGACAAATCTGCTCAAATTTATGCGCGCTTCGATAAAAATGATTGGCGTATCCAACCTGCA
+
diff --git a/tool_collections/samtools/samtools_calmd/test-data/samtools_calmd_out_1.bam b/tool_collections/samtools/samtools_calmd/test-data/samtools_calmd_out_1.bam
new file mode 100644
index 00000000000..8a59449a5d1
Binary files /dev/null and b/tool_collections/samtools/samtools_calmd/test-data/samtools_calmd_out_1.bam differ
diff --git a/tool_collections/samtools/samtools_calmd/test-data/samtools_calmd_out_2.bam b/tool_collections/samtools/samtools_calmd/test-data/samtools_calmd_out_2.bam
new file mode 100644
index 00000000000..90808fd28ed
Binary files /dev/null and b/tool_collections/samtools/samtools_calmd/test-data/samtools_calmd_out_2.bam differ
diff --git a/tool_collections/samtools/samtools_calmd/tool-data/fasta_indexes.loc.sample b/tool_collections/samtools/samtools_calmd/tool-data/fasta_indexes.loc.sample
new file mode 100644
index 00000000000..fd8fb70ac28
--- /dev/null
+++ b/tool_collections/samtools/samtools_calmd/tool-data/fasta_indexes.loc.sample
@@ -0,0 +1,29 @@
+#This is a sample file distributed with Galaxy that enables tools
+#to use a directory of Samtools indexed sequences data files. You will need
+#to create these data files and then create a fasta_indexes.loc file
+#similar to this one (store it in this directory) that points to
+#the directories in which those files are stored. The fasta_indexes.loc
+#file has this format (white space characters are TAB characters):
+#
+#
+#
+#So, for example, if you had hg19 Canonical indexed stored in
+#
+# /depot/data2/galaxy/hg19/sam/,
+#
+#then the fasta_indexes.loc entry would look like this:
+#
+#hg19canon hg19 Human (Homo sapiens): hg19 Canonical /depot/data2/galaxy/hg19/sam/hg19canon.fa
+#
+#and your /depot/data2/galaxy/hg19/sam/ directory
+#would contain hg19canon.fa and hg19canon.fa.fai files.
+#
+#Your fasta_indexes.loc file should include an entry per line for
+#each index set you have stored. The file in the path does actually
+#exist, but it should never be directly used. Instead, the name serves
+#as a prefix for the index file. For example:
+#
+#hg18canon hg18 Human (Homo sapiens): hg18 Canonical /depot/data2/galaxy/hg18/sam/hg18canon.fa
+#hg18full hg18 Human (Homo sapiens): hg18 Full /depot/data2/galaxy/hg18/sam/hg18full.fa
+#hg19canon hg19 Human (Homo sapiens): hg19 Canonical /depot/data2/galaxy/hg19/sam/hg19canon.fa
+#hg19full hg19 Human (Homo sapiens): hg19 Full /depot/data2/galaxy/hg19/sam/hg19full.fa
diff --git a/tool_collections/samtools/samtools_calmd/tool_data_table_conf.xml.sample b/tool_collections/samtools/samtools_calmd/tool_data_table_conf.xml.sample
new file mode 100644
index 00000000000..a1c5e1ad88e
--- /dev/null
+++ b/tool_collections/samtools/samtools_calmd/tool_data_table_conf.xml.sample
@@ -0,0 +1,7 @@
+
+
+
+ value, dbkey, name, path
+
+
+
diff --git a/tool_collections/samtools/samtools_flagstat/.shed.yml b/tool_collections/samtools/samtools_flagstat/.shed.yml
new file mode 100644
index 00000000000..8b20fb87320
--- /dev/null
+++ b/tool_collections/samtools/samtools_flagstat/.shed.yml
@@ -0,0 +1,11 @@
+categories:
+- SAM
+- Statistics
+description: Provides simple stats on BAM files.
+long_description: |
+ This tool uses the SAMtools toolkit to produce simple stats on a BAM file.
+name: samtools_flagstat
+owner: devteam
+homepage_url: http://www.htslib.org/
+remote_repository_url: https://github.com/galaxyproject/tools-iuc/tree/master/tool_collections/samtools/samtools_flagstat
+type: unrestricted
diff --git a/tool_collections/samtools/samtools_flagstat/macros.xml b/tool_collections/samtools/samtools_flagstat/macros.xml
new file mode 120000
index 00000000000..0c6ff6c3756
--- /dev/null
+++ b/tool_collections/samtools/samtools_flagstat/macros.xml
@@ -0,0 +1 @@
+../macros.xml
\ No newline at end of file
diff --git a/tool_collections/samtools/samtools_flagstat/samtools_flagstat.xml b/tool_collections/samtools/samtools_flagstat/samtools_flagstat.xml
new file mode 100644
index 00000000000..b78d6feaa43
--- /dev/null
+++ b/tool_collections/samtools/samtools_flagstat/samtools_flagstat.xml
@@ -0,0 +1,51 @@
+
+ tabulate descriptive stats for BAM datset
+
+
+ macros.xml
+
+
+
+
+
+ '$output1'
+ ]]>
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+**What it does**
+
+Uses ``samtools flagstat`` command to print descriptive information for a BAM dataset. Here is an example of such information::
+
+ 200 + 0 in total (QC-passed reads + QC-failed reads)
+ 0 + 0 secondary
+ 0 + 0 supplementary
+ 0 + 0 duplicates
+ 25 + 0 mapped (12.50%:nan%)
+ 200 + 0 paired in sequencing
+ 100 + 0 read1
+ 100 + 0 read2
+ 0 + 0 properly paired (0.00%:nan%)
+ 0 + 0 with itself and mate mapped
+ 25 + 0 singletons (12.50%:nan%)
+ 0 + 0 with mate mapped to a different chr
+ 0 + 0 with mate mapped to a different chr (mapQ>=5)
+
+
+
+
diff --git a/tool_collections/samtools/samtools_flagstat/test-data/samtools_flagstat_input1.bam b/tool_collections/samtools/samtools_flagstat/test-data/samtools_flagstat_input1.bam
new file mode 100644
index 00000000000..665d3e03028
Binary files /dev/null and b/tool_collections/samtools/samtools_flagstat/test-data/samtools_flagstat_input1.bam differ
diff --git a/tool_collections/samtools/samtools_flagstat/test-data/samtools_flagstat_out1.txt b/tool_collections/samtools/samtools_flagstat/test-data/samtools_flagstat_out1.txt
new file mode 100644
index 00000000000..f423a1614d9
--- /dev/null
+++ b/tool_collections/samtools/samtools_flagstat/test-data/samtools_flagstat_out1.txt
@@ -0,0 +1,13 @@
+200 + 0 in total (QC-passed reads + QC-failed reads)
+0 + 0 secondary
+0 + 0 supplementary
+0 + 0 duplicates
+25 + 0 mapped (12.50% : N/A)
+200 + 0 paired in sequencing
+100 + 0 read1
+100 + 0 read2
+0 + 0 properly paired (0.00% : N/A)
+0 + 0 with itself and mate mapped
+25 + 0 singletons (12.50% : N/A)
+0 + 0 with mate mapped to a different chr
+0 + 0 with mate mapped to a different chr (mapQ>=5)
diff --git a/tool_collections/samtools/samtools_idxstats/.shed.yml b/tool_collections/samtools/samtools_idxstats/.shed.yml
new file mode 100644
index 00000000000..445ef516ffc
--- /dev/null
+++ b/tool_collections/samtools/samtools_idxstats/.shed.yml
@@ -0,0 +1,12 @@
+categories:
+- SAM
+- Statistics
+description: BAM mapping statistics (using samtools idxstats)
+long_description: |
+ Runs "samtools idxstats" to get mapping statistics from a sorted
+ index BAM file as a tabular output file.
+name: samtools_idxstats
+owner: devteam
+homepage_url: http://www.htslib.org/
+remote_repository_url: https://github.com/galaxyproject/tools-iuc/tree/master/tool_collections/samtools/samtools_idxstats
+type: unrestricted
diff --git a/tool_collections/samtools/samtools_idxstats/macros.xml b/tool_collections/samtools/samtools_idxstats/macros.xml
new file mode 120000
index 00000000000..0c6ff6c3756
--- /dev/null
+++ b/tool_collections/samtools/samtools_idxstats/macros.xml
@@ -0,0 +1 @@
+../macros.xml
\ No newline at end of file
diff --git a/tool_collections/samtools/samtools_idxstats/samtools_idxstats.xml b/tool_collections/samtools/samtools_idxstats/samtools_idxstats.xml
new file mode 100644
index 00000000000..5297ad61b0f
--- /dev/null
+++ b/tool_collections/samtools/samtools_idxstats/samtools_idxstats.xml
@@ -0,0 +1,66 @@
+
+ reports stats of the BAM index file
+
+ macros.xml
+
+
+
+
+ '${output}'
+ ]]>
+
+
+
+
+
+
+
+
+
+
+
+
+ `_ for the samtools idxstats command
+]]>
+
+
+
diff --git a/tool_collections/samtools/samtools_idxstats/test-data/phiX.bam b/tool_collections/samtools/samtools_idxstats/test-data/phiX.bam
new file mode 100644
index 00000000000..e7c52270025
Binary files /dev/null and b/tool_collections/samtools/samtools_idxstats/test-data/phiX.bam differ
diff --git a/tool_collections/samtools/samtools_idxstats/test-data/samtools_idxstats_out.tabular b/tool_collections/samtools/samtools_idxstats/test-data/samtools_idxstats_out.tabular
new file mode 100644
index 00000000000..f51b037ac21
--- /dev/null
+++ b/tool_collections/samtools/samtools_idxstats/test-data/samtools_idxstats_out.tabular
@@ -0,0 +1,2 @@
+phiX174 5386 972 18
+* 0 0 10
diff --git a/tool_collections/samtools/samtools_mpileup/.shed.yml b/tool_collections/samtools/samtools_mpileup/.shed.yml
new file mode 100644
index 00000000000..513cf55e3d3
--- /dev/null
+++ b/tool_collections/samtools/samtools_mpileup/.shed.yml
@@ -0,0 +1,13 @@
+categories:
+- SAM
+- Variant Analysis
+description: MPileup SNP and indel caller
+long_description: |
+ Generate BCF or pileup for one or multiple BAM files. Alignment
+ records are grouped by sample identifiers in @RG header lines. If sample identifiers
+ are absent, each input file is regarded as one sample.
+name: samtools_mpileup
+owner: devteam
+homepage_url: http://www.htslib.org/
+remote_repository_url: https://github.com/galaxyproject/tools-iuc/tree/master/tool_collections/samtools/samtools_mpileup
+type: unrestricted
diff --git a/tool_collections/samtools/samtools_mpileup/macros.xml b/tool_collections/samtools/samtools_mpileup/macros.xml
new file mode 120000
index 00000000000..0c6ff6c3756
--- /dev/null
+++ b/tool_collections/samtools/samtools_mpileup/macros.xml
@@ -0,0 +1 @@
+../macros.xml
\ No newline at end of file
diff --git a/tool_collections/samtools/samtools_mpileup/samtools_mpileup.xml b/tool_collections/samtools/samtools_mpileup/samtools_mpileup.xml
new file mode 100644
index 00000000000..9d703fd5a8f
--- /dev/null
+++ b/tool_collections/samtools/samtools_mpileup/samtools_mpileup.xml
@@ -0,0 +1,386 @@
+
+ multi-way pileup of variants
+
+ macros.xml
+
+
+
+
+ "$output_log"
+ ]]>
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/tool_collections/samtools/samtools_mpileup/test-data/phiX.bam b/tool_collections/samtools/samtools_mpileup/test-data/phiX.bam
new file mode 100644
index 00000000000..1375552cf36
Binary files /dev/null and b/tool_collections/samtools/samtools_mpileup/test-data/phiX.bam differ
diff --git a/tool_collections/samtools/samtools_mpileup/test-data/phiX.fasta b/tool_collections/samtools/samtools_mpileup/test-data/phiX.fasta
new file mode 100644
index 00000000000..53df885dc48
--- /dev/null
+++ b/tool_collections/samtools/samtools_mpileup/test-data/phiX.fasta
@@ -0,0 +1,79 @@
+>phiX174
+GAGTTTTATCGCTTCCATGACGCAGAAGTTAACACTTTCGGATATTTCTGATGAGTCGAAAAATTATCTT
+GATAAAGCAGGAATTACTACTGCTTGTTTACGAATTAAATCGAAGTGGACTGCTGGCGGAAAATGAGAAA
+ATTCGACCTATCCTTGCGCAGCTCGAGAAGCTCTTACTTTGCGACCTTTCGCCATCAACTAACGATTCTG
+TCAAAAACTGACGCGTTGGATGAGGAGAAGTGGCTTAATATGCTTGGCACGTTCGTCAAGGACTGGTTTA
+GATATGAGTCACATTTTGTTCATGGTAGAGATTCTCTTGTTGACATTTTAAAAGAGCGTGGATTACTATC
+TGAGTCCGATGCTGTTCAACCACTAATAGGTAAGAAATCATGAGTCAAGTTACTGAACAATCCGTACGTT
+TCCAGACCGCTTTGGCCTCTATTAAGCTCATTCAGGCTTCTGCCGTTTTGGATTTAACCGAAGATGATTT
+CGATTTTCTGACGAGTAACAAAGTTTGGATTGCTACTGACCGCTCTCGTGCTCGTCGCTGCGTTGAGGCT
+TGCGTTTATGGTACGCTGGACTTTGTGGGATACCCTCGCTTTCCTGCTCCTGTTGAGTTTATTGCTGCCG
+TCATTGCTTATTATGTTCATCCCGTCAACATTCAAACGGCCTGTCTCATCATGGAAGGCGCTGAATTTAC
+GGAAAACATTATTAATGGCGTCGAGCGTCCGGTTAAAGCCGCTGAATTGTTCGCGTTTACCTTGCGTGTA
+CGCGCAGGAAACACTGACGTTCTTACTGACGCAGAAGAAAACGTGCGTCAAAAATTACGTGCAGAAGGAG
+TGATGTAATGTCTAAAGGTAAAAAACGTTCTGGCGCTCGCCCTGGTCGTCCGCAGCCGTTGCGAGGTACT
+AAAGGCAAGCGTAAAGGCGCTCGTCTTTGGTATGTAGGTGGTCAACAATTTTAATTGCAGGGGCTTCGGC
+CCCTTACTTGAGGATAAATTATGTCTAATATTCAAACTGGCGCCGAGCGTATGCCGCATGACCTTTCCCA
+TCTTGGCTTCCTTGCTGGTCAGATTGGTCGTCTTATTACCATTTCAACTACTCCGGTTATCGCTGGCGAC
+TCCTTCGAGATGGACGCCGTTGGCGCTCTCCGTCTTTCTCCATTGCGTCGTGGCCTTGCTATTGACTCTA
+CTGTAGACATTTTTACTTTTTATGTCCCTCATCGTCACGTTTATGGTGAACAGTGGATTAAGTTCATGAA
+GGATGGTGTTAATGCCACTCCTCTCCCGACTGTTAACACTACTGGTTATATTGACCATGCCGCTTTTCTT
+GGCACGATTAACCCTGATACCAATAAAATCCCTAAGCATTTGTTTCAGGGTTATTTGAATATCTATAACA
+ACTATTTTAAAGCGCCGTGGATGCCTGACCGTACCGAGGCTAACCCTAATGAGCTTAATCAAGATGATGC
+TCGTTATGGTTTCCGTTGCTGCCATCTCAAAAACATTTGGACTGCTCCGCTTCCTCCTGAGACTGAGCTT
+TCTCGCCAAATGACGACTTCTACCACATCTATTGACATTATGGGTCTGCAAGCTGCTTATGCTAATTTGC
+ATACTGACCAAGAACGTGATTACTTCATGCAGCGTTACCGTGATGTTATTTCTTCATTTGGAGGTAAAAC
+CTCTTATGACGCTGACAACCGTCCTTTACTTGTCATGCGCTCTAATCTCTGGGCATCTGGCTATGATGTT
+GATGGAACTGACCAAACGTCGTTAGGCCAGTTTTCTGGTCGTGTTCAACAGACCTATAAACATTCTGTGC
+CGCGTTTCTTTGTTCCTGAGCATGGCACTATGTTTACTCTTGCGCTTGTTCGTTTTCCGCCTACTGCGAC
+TAAAGAGATTCAGTACCTTAACGCTAAAGGTGCTTTGACTTATACCGATATTGCTGGCGACCCTGTTTTG
+TATGGCAACTTGCCGCCGCGTGAAATTTCTATGAAGGATGTTTTCCGTTCTGGTGATTCGTCTAAGAAGT
+TTAAGATTGCTGAGGGTCAGTGGTATCGTTATGCGCCTTCGTATGTTTCTCCTGCTTATCACCTTCTTGA
+AGGCTTCCCATTCATTCAGGAACCGCCTTCTGGTGATTTGCAAGAACGCGTACTTATTCGCCACCATGAT
+TATGACCAGTGTTTCCAGTCCGTTCAGTTGTTGCAGTGGAATAGTCAGGTTAAATTTAATGTGACCGTTT
+ATCGCAATCTGCCGACCACTCGCGATTCAATCATGACTTCGTGATAAAAGATTGAGTGTGAGGTTATAAC
+GCCGAAGCGGTAAAAATTTTAATTTTTGCCGCTGAGGGGTTGACCAAGCGAAGCGCGGTAGGTTTTCTGC
+TTAGGAGTTTAATCATGTTTCAGACTTTTATTTCTCGCCATAATTCAAACTTTTTTTCTGATAAGCTGGT
+TCTCACTTCTGTTACTCCAGCTTCTTCGGCACCTGTTTTACAGACACCTAAAGCTACATCGTCAACGTTA
+TATTTTGATAGTTTGACGGTTAATGCTGGTAATGGTGGTTTTCTTCATTGCATTCAGATGGATACATCTG
+TCAACGCCGCTAATCAGGTTGTTTCTGTTGGTGCTGATATTGCTTTTGATGCCGACCCTAAATTTTTTGC
+CTGTTTGGTTCGCTTTGAGTCTTCTTCGGTTCCGACTACCCTCCCGACTGCCTATGATGTTTATCCTTTG
+AATGGTCGCCATGATGGTGGTTATTATACCGTCAAGGACTGTGTGACTATTGACGTCCTTCCCCGTACGC
+CGGGCAATAATGTTTATGTTGGTTTCATGGTTTGGTCTAACTTTACCGCTACTAAATGCCGCGGATTGGT
+TTCGCTGAATCAGGTTATTAAAGAGATTATTTGTCTCCAGCCACTTAAGTGAGGTGATTTATGTTTGGTG
+CTATTGCTGGCGGTATTGCTTCTGCTCTTGCTGGTGGCGCCATGTCTAAATTGTTTGGAGGCGGTCAAAA
+AGCCGCCTCCGGTGGCATTCAAGGTGATGTGCTTGCTACCGATAACAATACTGTAGGCATGGGTGATGCT
+GGTATTAAATCTGCCATTCAAGGCTCTAATGTTCCTAACCCTGATGAGGCCGCCCCTAGTTTTGTTTCTG
+GTGCTATGGCTAAAGCTGGTAAAGGACTTCTTGAAGGTACGTTGCAGGCTGGCACTTCTGCCGTTTCTGA
+TAAGTTGCTTGATTTGGTTGGACTTGGTGGCAAGTCTGCCGCTGATAAAGGAAAGGATACTCGTGATTAT
+CTTGCTGCTGCATTTCCTGAGCTTAATGCTTGGGAGCGTGCTGGTGCTGATGCTTCCTCTGCTGGTATGG
+TTGACGCCGGATTTGAGAATCAAAAAGAGCTTACTAAAATGCAACTGGACAATCAGAAAGAGATTGCCGA
+GATGCAAAATGAGACTCAAAAAGAGATTGCTGGCATTCAGTCGGCGACTTCACGCCAGAATACGAAAGAC
+CAGGTATATGCACAAAATGAGATGCTTGCTTATCAACAGAAGGAGTCTACTGCTCGCGTTGCGTCTATTA
+TGGAAAACACCAATCTTTCCAAGCAACAGCAGGTTTCCGAGATTATGCGCCAAATGCTTACTCAAGCTCA
+AACGGCTGGTCAGTATTTTACCAATGACCAAATCAAAGAAATGACTCGCAAGGTTAGTGCTGAGGTTGAC
+TTAGTTCATCAGCAAACGCAGAATCAGCGGTATGGCTCTTCTCATATTGGCGCTACTGCAAAGGATATTT
+CTAATGTCGTCACTGATGCTGCTTCTGGTGTGGTTGATATTTTTCATGGTATTGATAAAGCTGTTGCCGA
+TACTTGGAACAATTTCTGGAAAGACGGTAAAGCTGATGGTATTGGCTCTAATTTGTCTAGGAAATAACCG
+TCAGGATTGACACCCTCCCAATTGTATGTTTTCATGCCTCCAAATCTTGGAGGCTTTTTTATGGTTCGTT
+CTTATTACCCTTCTGAATGTCACGCTGATTATTTTGACTTTGAGCGTATCGAGGCTCTTAAACCTGCTAT
+TGAGGCTTGTGGCATTTCTACTCTTTCTCAATCCCCAATGCTTGGCTTCCATAAGCAGATGGATAACCGC
+ATCAAGCTCTTGGAAGAGATTCTGTCTTTTCGTATGCAGGGCGTTGAGTTCGATAATGGTGATATGTATG
+TTGACGGCCATAAGGCTGCTTCTGACGTTCGTGATGAGTTTGTATCTGTTACTGAGAAGTTAATGGATGA
+ATTGGCACAATGCTACAATGTGCTCCCCCAACTTGATATTAATAACACTATAGACCACCGCCCCGAAGGG
+GACGAAAAATGGTTTTTAGAGAACGAGAAGACGGTTACGCAGTTTTGCCGCAAGCTGGCTGCTGAACGCC
+CTCTTAAGGATATTCGCGATGAGTATAATTACCCCAAAAAGAAAGGTATTAAGGATGAGTGTTCAAGATT
+GCTGGAGGCCTCCACTATGAAATCGCGTAGAGGCTTTACTATTCAGCGTTTGATGAATGCAATGCGACAG
+GCTCATGCTGATGGTTGGTTTATCGTTTTTGACACTCTCACGTTGGCTGACGACCGATTAGAGGCGTTTT
+ATGATAATCCCAATGCTTTGCGTGACTATTTTCGTGATATTGGTCGTATGGTTCTTGCTGCCGAGGGTCG
+CAAGGCTAATGATTCACACGCCGACTGCTATCAGTATTTTTGTGTGCCTGAGTATGGTACAGCTAATGGC
+CGTCTTCATTTCCATGCGGTGCATTTTATGCGGACACTTCCTACAGGTAGCGTTGACCCTAATTTTGGTC
+GTCGGGTACGCAATCGCCGCCAGTTAAATAGCTTGCAAAATACGTGGCCTTATGGTTACAGTATGCCCAT
+CGCAGTTCGCTACACGCAGGACGCTTTTTCACGTTCTGGTTGGTTGTGGCCTGTTGATGCTAAAGGTGAG
+CCGCTTAAAGCTACCAGTTATATGGCTGTTGGTTTCTATGTGGCTAAATACGTTAACAAAAAGTCAGATA
+TGGACCTTGCTGCTAAAGGTCTAGGAGCTAAAGAATGGAACAACTCACTAAAAACCAAGCTGTCGCTACT
+TCCCAAGAAGCTGTTCAGAATCAGAATGAGCCGCAACTTCGGGATGAAAATGCTCACAATGACAAATCTG
+TCCACGGAGTGCTTAATCCAACTTACCAAGCTGGGTTACGACGCGACGCCGTTCAACCAGATATTGAAGC
+AGAACGCAAAAAGAGAGATGAGATTGAGGCTGGGAAAAGTTACTGTAGCCGACGTTTTGGCGGCGCAACC
+TGTGACGACAAATCTGCTCAAATTTATGCGCGCTTCGATAAAAATGATTGGCGTATCCAACCTGCA
+
diff --git a/tool_collections/samtools/samtools_mpileup/test-data/phiX_1.bam b/tool_collections/samtools/samtools_mpileup/test-data/phiX_1.bam
new file mode 100644
index 00000000000..a1aae19efa4
Binary files /dev/null and b/tool_collections/samtools/samtools_mpileup/test-data/phiX_1.bam differ
diff --git a/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_in_1.bam b/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_in_1.bam
new file mode 100644
index 00000000000..07a05ba9d4e
Binary files /dev/null and b/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_in_1.bam differ
diff --git a/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_out_1.log.txt b/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_out_1.log.txt
new file mode 100644
index 00000000000..43ec7ed1b9d
--- /dev/null
+++ b/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_out_1.log.txt
@@ -0,0 +1,3 @@
+[fai_load] build FASTA index.
+[mpileup] 1 samples in 1 input files
+ Set max per-file depth to 8000
diff --git a/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_out_1.pileup b/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_out_1.pileup
new file mode 100644
index 00000000000..fad9cab45f6
--- /dev/null
+++ b/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_out_1.pileup
@@ -0,0 +1,43 @@
+phiX174 1411 A 0
+phiX174 1412 G 0
+phiX174 1413 C 0
+phiX174 1414 G 0
+phiX174 1415 C 0
+phiX174 1416 C 0
+phiX174 1417 G 0
+phiX174 1418 T 0
+phiX174 1419 G 0
+phiX174 1420 G 0
+phiX174 1421 A 0
+phiX174 1422 T 0
+phiX174 1423 G 0
+phiX174 1424 C 0
+phiX174 1425 C 0
+phiX174 1426 T 0
+phiX174 1427 G 0
+phiX174 1428 A 0
+phiX174 1429 C 0
+phiX174 1430 C 0
+phiX174 1431 G 0
+phiX174 1432 T 0
+phiX174 1433 A 0
+phiX174 1434 C 0
+phiX174 1435 C 0
+phiX174 1436 G 0
+phiX174 1437 A 0
+phiX174 1438 G 0
+phiX174 1439 G 0
+phiX174 1440 C 0
+phiX174 1441 T 0
+phiX174 1442 A 0
+phiX174 1443 A 0
+phiX174 1444 C 0
+phiX174 1445 C 0
+phiX174 1446 C 0
+phiX174 1447 T 0
+phiX174 1448 A 0
+phiX174 1449 A 0
+phiX174 1450 T 0
+phiX174 1451 G 0
+phiX174 1452 A 0
+phiX174 1453 G 0
diff --git a/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_out_2.log.txt b/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_out_2.log.txt
new file mode 100644
index 00000000000..43ec7ed1b9d
--- /dev/null
+++ b/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_out_2.log.txt
@@ -0,0 +1,3 @@
+[fai_load] build FASTA index.
+[mpileup] 1 samples in 1 input files
+ Set max per-file depth to 8000
diff --git a/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_out_2.vcf b/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_out_2.vcf
new file mode 100644
index 00000000000..5a1dd7641ba
--- /dev/null
+++ b/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_out_2.vcf
@@ -0,0 +1,22 @@
+##fileformat=VCFv4.2
+##FILTER=
+##samtoolsVersion=1.1+htslib-1.1
+##samtoolsCommand=samtools mpileup --VCF --uncompressed -f /tmp/tmpId8vOP/tmp3bubIE/database/files/000/dataset_735.dat -g -e 20 -h 100 -L 250 -m 1 --open-prob 40 -F 0.002 -e 40 --output /tmp/tmpId8vOP/tmp3bubIE/database/files/000/dataset_736.dat /tmp/tmpId8vOP/tmp3bubIE/database/files/000/dataset_734.dat
+##reference=file:///tmp/tmpId8vOP/tmp3bubIE/database/files/000/dataset_735.dat
+##contig=
+##ALT=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##FORMAT=
+#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT localbam_0.bam
diff --git a/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_out_3.pileup b/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_out_3.pileup
new file mode 100644
index 00000000000..f494a27538d
--- /dev/null
+++ b/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_out_3.pileup
@@ -0,0 +1,43 @@
+phiX174 1411 A 1 ^P. $ P 1
+phiX174 1412 G 3 .^D.^F. "$$ PDF 2,1,1
+phiX174 1413 C 5 ...^D.^F. """$$ PDFDF 3,2,2,1,1
+phiX174 1414 G 6 .....^F. #####$ PDFDFF 4,3,3,2,2,1
+phiX174 1415 C 7 ......^F. %%%%%%& PDFDFFF 5,4,4,3,3,2,1
+phiX174 1416 C 8 .......^F. $$$$$$$$ PDFDFFFF 6,5,5,4,4,3,2,1
+phiX174 1417 G 9 ........^F. "#######$ PDFDFFFFF 7,6,6,5,5,4,3,2,1
+phiX174 1418 T 10 .........^F. """""""""$ PDFDFFFFFF 8,7,7,6,6,5,4,3,2,1
+phiX174 1419 G 10 .......... """""'&'%$ PDFDFFFFFF 9,8,8,7,7,6,5,4,3,2
+phiX174 1420 G 10 .......... """""""""" PDFDFFFFFF 10,9,9,8,8,7,6,5,4,3
+phiX174 1421 A 10 .......... """""""""" PDFDFFFFFF 11,10,10,9,9,8,7,6,5,4
+phiX174 1422 T 10 .......... """""""""" PDFDFFFFFF 12,11,11,10,10,9,8,7,6,5
+phiX174 1423 G 10 .......... """""""""# PDFDFFFFFF 13,12,12,11,11,10,9,8,7,6
+phiX174 1424 C 10 ..A.AAAAAA %""""""""" PDFDFFFFFF 14,13,13,12,12,11,10,9,8,7
+phiX174 1425 C 10 .......... $$$""""""" PDFDFFFFFF 15,14,14,13,13,12,11,10,9,8
+phiX174 1426 T 10 .......... #####""""" PDFDFFFFFF 16,15,15,14,14,13,12,11,10,9
+phiX174 1427 G 10 .......... ######"""" PDFDFFFFFF 17,16,16,15,15,14,13,12,11,10
+phiX174 1428 A 10 .......... """""""""" PDFDFFFFFF 18,17,17,16,16,15,14,13,12,11
+phiX174 1429 C 10 .......... ((((((&("" PDFDFFFFFF 19,18,18,17,17,16,15,14,13,12
+phiX174 1430 C 10 .......... $$$$$$$$$" PDFDFFFFFF 20,19,19,18,18,17,16,15,14,13
+phiX174 1431 G 10 .......... ########## PDFDFFFFFF 21,20,20,19,19,18,17,16,15,14
+phiX174 1432 T 10 .......... """""""""" PDFDFFFFFF 22,21,21,20,20,19,18,17,16,15
+phiX174 1433 A 10 .......... ########## PDFDFFFFFF 23,22,22,21,21,20,19,18,17,16
+phiX174 1434 C 10 .......... ((((((&(%$ PDFDFFFFFF 24,23,23,22,22,21,20,19,18,17
+phiX174 1435 C 10 .......... $$$$$$$$$$ PDFDFFFFFF 25,24,24,23,23,22,21,20,19,18
+phiX174 1436 G 10 .......... ########## PDFDFFFFFF 26,25,25,24,24,23,22,21,20,19
+phiX174 1437 A 10 .......... """""""""! PDFDFFFFFF 27,26,26,25,25,24,23,22,21,20
+phiX174 1438 G 10 .......... """""####! PDFDFFFFFF 28,27,27,26,26,25,24,23,22,21
+phiX174 1439 G 10 .......... """""""""! PDFDFFFFFF 29,28,28,27,27,26,25,24,23,22
+phiX174 1440 C 10 .......... """""""""! PDFDFFFFFF 30,29,29,28,28,27,26,25,24,23
+phiX174 1441 T 10 .......... """"""""#! PDFDFFFFFF 31,30,30,29,29,28,27,26,25,24
+phiX174 1442 A 10 .......... $$$%%%&&%! PDFDFFFFFF 32,31,31,30,30,29,28,27,26,25
+phiX174 1443 A 10 .-1C.-1C..-1C...... """""""""! PDFDFFFFFF 33,32,32,31,31,30,29,28,27,26
+phiX174 1444 C 10 **.*...... !!"!"""""! PDFDFFFFFF 34,33,33,32,32,31,30,29,28,27
+phiX174 1445 C 10 .......... !!&!%%%&%! PDFDFFFFFF 34,33,34,32,33,32,31,30,29,28
+phiX174 1446 C 10 .......... !!"!"""""! PDFDFFFFFF 35,34,35,33,34,33,32,31,30,29
+phiX174 1447 T 10 .$..$....... !!#!"""""! PDFDFFFFFF 36,35,36,34,35,34,33,32,31,30
+phiX174 1448 A 8 .$..$..... !!#%%$$! DDFFFFFF 36,35,36,35,34,33,32,31
+phiX174 1449 A 6 .$.$.... !""""! DFFFFF 36,36,35,34,33,32
+phiX174 1450 T 4 .$... """! FFFF 36,35,34,33
+phiX174 1451 G 3 .$.. #"! FFF 36,35,34
+phiX174 1452 A 2 .$. "! FF 36,35
+phiX174 1453 G 1 .$ ! F 36
diff --git a/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_out_4.vcf b/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_out_4.vcf
new file mode 100644
index 00000000000..ddc2d30ad2e
--- /dev/null
+++ b/tool_collections/samtools/samtools_mpileup/test-data/samtools_mpileup_out_4.vcf
@@ -0,0 +1,22 @@
+##fileformat=VCFv4.2
+##FILTER=
+##samtoolsVersion=1.1+htslib-1.1
+##samtoolsCommand=samtools mpileup --VCF --uncompressed -f /var/galaxy/workspace/CleanGalaxy/tool-data/phiX/sam_indexes/phiX/phiX.fa -C 0 -d 200 -E -q 0 -Q 43 -g -e 20 -h 100 -L 250 -m 1 --open-prob 40 -F 0.002 -e 40 --output /tmp/tmp5TzrZC/tmpDtGVov/database/files/000/dataset_756.dat /tmp/tmp5TzrZC/tmpDtGVov/database/files/000/dataset_755.dat
+##reference=file:///var/galaxy/workspace/CleanGalaxy/tool-data/phiX/sam_indexes/phiX/phiX.fa
+##contig=
+##ALT=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##FORMAT=
+#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT /tmp/tmp5TzrZC/tmpDtGVov/database/files/000/dataset_755.dat
diff --git a/tool_collections/samtools/samtools_mpileup/tool-data/fasta_indexes.loc.sample b/tool_collections/samtools/samtools_mpileup/tool-data/fasta_indexes.loc.sample
new file mode 100644
index 00000000000..fd8fb70ac28
--- /dev/null
+++ b/tool_collections/samtools/samtools_mpileup/tool-data/fasta_indexes.loc.sample
@@ -0,0 +1,29 @@
+#This is a sample file distributed with Galaxy that enables tools
+#to use a directory of Samtools indexed sequences data files. You will need
+#to create these data files and then create a fasta_indexes.loc file
+#similar to this one (store it in this directory) that points to
+#the directories in which those files are stored. The fasta_indexes.loc
+#file has this format (white space characters are TAB characters):
+#
+#
+#
+#So, for example, if you had hg19 Canonical indexed stored in
+#
+# /depot/data2/galaxy/hg19/sam/,
+#
+#then the fasta_indexes.loc entry would look like this:
+#
+#hg19canon hg19 Human (Homo sapiens): hg19 Canonical /depot/data2/galaxy/hg19/sam/hg19canon.fa
+#
+#and your /depot/data2/galaxy/hg19/sam/ directory
+#would contain hg19canon.fa and hg19canon.fa.fai files.
+#
+#Your fasta_indexes.loc file should include an entry per line for
+#each index set you have stored. The file in the path does actually
+#exist, but it should never be directly used. Instead, the name serves
+#as a prefix for the index file. For example:
+#
+#hg18canon hg18 Human (Homo sapiens): hg18 Canonical /depot/data2/galaxy/hg18/sam/hg18canon.fa
+#hg18full hg18 Human (Homo sapiens): hg18 Full /depot/data2/galaxy/hg18/sam/hg18full.fa
+#hg19canon hg19 Human (Homo sapiens): hg19 Canonical /depot/data2/galaxy/hg19/sam/hg19canon.fa
+#hg19full hg19 Human (Homo sapiens): hg19 Full /depot/data2/galaxy/hg19/sam/hg19full.fa
diff --git a/tool_collections/samtools/samtools_mpileup/tool_data_table_conf.xml.sample b/tool_collections/samtools/samtools_mpileup/tool_data_table_conf.xml.sample
new file mode 100644
index 00000000000..c34ade01890
--- /dev/null
+++ b/tool_collections/samtools/samtools_mpileup/tool_data_table_conf.xml.sample
@@ -0,0 +1,7 @@
+
+
+
+
diff --git a/tool_collections/vcflib/LICENSE b/tool_collections/vcflib/LICENSE
new file mode 100644
index 00000000000..d176cde6c50
--- /dev/null
+++ b/tool_collections/vcflib/LICENSE
@@ -0,0 +1,21 @@
+The MIT License (MIT)
+
+Copyright (c) 2014 Anton Nekrutenko
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.
\ No newline at end of file
diff --git a/tool_collections/vcflib/README.rst b/tool_collections/vcflib/README.rst
new file mode 100644
index 00000000000..d80c24d0e94
--- /dev/null
+++ b/tool_collections/vcflib/README.rst
@@ -0,0 +1,9 @@
+vcflib-tools
+============
+
+A collection of Galaxy wrappers for vcflib utilities. This is a
+development copy of the tools that exist in the Galaxy Tool Shed
+(http://usegalaxy.org/toolshed).
+
+Because vcflib can only be reliably compiled on Linux platforms at this
+time these wrappers make this assumption.
diff --git a/tool_collections/vcflib/macros.xml b/tool_collections/vcflib/macros.xml
new file mode 100644
index 00000000000..f5de643d57c
--- /dev/null
+++ b/tool_collections/vcflib/macros.xml
@@ -0,0 +1,29 @@
+
+
+
+ vcflib
+
+
+
+
+
+
+
+
+ 1.0.0_rc1
+
+
+
+@misc{Garrison2015,
+ author = {Garrison, Erik},
+ year = {2015},
+ title = {vcflib},
+ publisher = {GitHub},
+ journal = {GitHub repository},
+ url = {https://github.com/ekg/vcflib},
+}
+
+
+
+ is a part of VCFlib toolkit developed by Erik Garrison (https://github.com/ekg/vcflib).
+
diff --git a/tool_collections/vcflib/tool-data/fasta_indexes.loc.sample b/tool_collections/vcflib/tool-data/fasta_indexes.loc.sample
new file mode 100644
index 00000000000..fd8fb70ac28
--- /dev/null
+++ b/tool_collections/vcflib/tool-data/fasta_indexes.loc.sample
@@ -0,0 +1,29 @@
+#This is a sample file distributed with Galaxy that enables tools
+#to use a directory of Samtools indexed sequences data files. You will need
+#to create these data files and then create a fasta_indexes.loc file
+#similar to this one (store it in this directory) that points to
+#the directories in which those files are stored. The fasta_indexes.loc
+#file has this format (white space characters are TAB characters):
+#
+#
+#
+#So, for example, if you had hg19 Canonical indexed stored in
+#
+# /depot/data2/galaxy/hg19/sam/,
+#
+#then the fasta_indexes.loc entry would look like this:
+#
+#hg19canon hg19 Human (Homo sapiens): hg19 Canonical /depot/data2/galaxy/hg19/sam/hg19canon.fa
+#
+#and your /depot/data2/galaxy/hg19/sam/ directory
+#would contain hg19canon.fa and hg19canon.fa.fai files.
+#
+#Your fasta_indexes.loc file should include an entry per line for
+#each index set you have stored. The file in the path does actually
+#exist, but it should never be directly used. Instead, the name serves
+#as a prefix for the index file. For example:
+#
+#hg18canon hg18 Human (Homo sapiens): hg18 Canonical /depot/data2/galaxy/hg18/sam/hg18canon.fa
+#hg18full hg18 Human (Homo sapiens): hg18 Full /depot/data2/galaxy/hg18/sam/hg18full.fa
+#hg19canon hg19 Human (Homo sapiens): hg19 Canonical /depot/data2/galaxy/hg19/sam/hg19canon.fa
+#hg19full hg19 Human (Homo sapiens): hg19 Full /depot/data2/galaxy/hg19/sam/hg19full.fa
diff --git a/tool_collections/vcflib/tool-data/tool_data_table_conf.xml.sample b/tool_collections/vcflib/tool-data/tool_data_table_conf.xml.sample
new file mode 100644
index 00000000000..9111f18a24b
--- /dev/null
+++ b/tool_collections/vcflib/tool-data/tool_data_table_conf.xml.sample
@@ -0,0 +1,8 @@
+
+
+
+
+ value, dbkey, name, path
+
+
+
diff --git a/tool_collections/vcflib/vcf2tsv/.shed.yml b/tool_collections/vcflib/vcf2tsv/.shed.yml
new file mode 100644
index 00000000000..46766cec405
--- /dev/null
+++ b/tool_collections/vcflib/vcf2tsv/.shed.yml
@@ -0,0 +1,14 @@
+categories:
+- Variant Analysis
+- Convert Formats
+description: Converts VCF files into tab-delimited format
+homepage_url: https://github.com/ekg/vcflib
+long_description: |
+ Converts stdin or given VCF file to tab-delimited format, using
+ null string to replace empty values in the table. Specifying -g will output one
+ line per sample with genotype information. A part of the vcflib utilities developed
+ by Erik Garrison (https://github.com/ekg/vcflib).
+name: vcf2tsv
+owner: devteam
+remote_repository_url: https://github.com/galaxyproject/tools-iuc/tree/master/tool_collections/vcflib/vcf2tsv
+type: unrestricted
diff --git a/tool_collections/vcflib/vcf2tsv/macros.xml b/tool_collections/vcflib/vcf2tsv/macros.xml
new file mode 120000
index 00000000000..0c6ff6c3756
--- /dev/null
+++ b/tool_collections/vcflib/vcf2tsv/macros.xml
@@ -0,0 +1 @@
+../macros.xml
\ No newline at end of file
diff --git a/tool_collections/vcflib/vcf2tsv/test-data/vcf2tsv-test1.tab b/tool_collections/vcflib/vcf2tsv/test-data/vcf2tsv-test1.tab
new file mode 100644
index 00000000000..30a213d7fe2
--- /dev/null
+++ b/tool_collections/vcflib/vcf2tsv/test-data/vcf2tsv-test1.tab
@@ -0,0 +1,28 @@
+CHROM POS ID REF ALT QUAL FILTER AA AC AF AN DP NS DB H2 SAMPLE DP GQ GT HQ
+19 111 . A C 9.6 . . . . . . . 0 0 NA00001 . . 0|0 10,10
+19 111 . A C 9.6 . . . . . . . 0 0 NA00002 . . 0|0 10,10
+19 111 . A C 9.6 . . . . . . . 0 0 NA00003 . . 0/1 3,3
+19 112 . A G 10 . . . . . . . 0 0 NA00001 . . 0|0 10,10
+19 112 . A G 10 . . . . . . . 0 0 NA00002 . . 0|0 10,10
+19 112 . A G 10 . . . . . . . 0 0 NA00003 . . 0/1 3,3
+20 14370 rs6054257 G A 29 PASS . . 0.5 . 14 3 1 1 NA00001 1 48 0|0 51,51
+20 14370 rs6054257 G A 29 PASS . . 0.5 . 14 3 1 1 NA00002 8 48 1|0 51,51
+20 14370 rs6054257 G A 29 PASS . . 0.5 . 14 3 1 1 NA00003 5 43 1/1 .,.
+20 17330 . T A 3 q10 . . 0.017 . 11 3 0 0 NA00001 3 49 0|0 58,50
+20 17330 . T A 3 q10 . . 0.017 . 11 3 0 0 NA00002 5 3 0|1 65,3
+20 17330 . T A 3 q10 . . 0.017 . 11 3 0 0 NA00003 3 41 0/0 .,.
+20 1110696 rs6040355 A G,T 67 PASS T . 0.333,0.667 . 10 2 1 0 NA00001 6 21 1|2 23,27
+20 1110696 rs6040355 A G,T 67 PASS T . 0.333,0.667 . 10 2 1 0 NA00002 0 2 2|1 18,2
+20 1110696 rs6040355 A G,T 67 PASS T . 0.333,0.667 . 10 2 1 0 NA00003 4 35 2/2 .,.
+20 1230237 . T . 47 PASS T . . . 13 3 0 0 NA00001 . 54 0|0 56,60
+20 1230237 . T . 47 PASS T . . . 13 3 0 0 NA00002 4 48 0|0 51,51
+20 1230237 . T . 47 PASS T . . . 13 3 0 0 NA00003 2 61 0/0 .,.
+20 1234567 microsat1 G GA,GAC 50 PASS G 3,1 . 6 9 3 0 0 NA00001 4 . 0/1 .
+20 1234567 microsat1 G GA,GAC 50 PASS G 3,1 . 6 9 3 0 0 NA00002 2 17 0/2 .
+20 1234567 microsat1 G GA,GAC 50 PASS G 3,1 . 6 9 3 0 0 NA00003 3 40 1/1 .
+20 1235237 . T . 0 . . . . . . . 0 0 NA00001 . . 0/0 .
+20 1235237 . T . 0 . . . . . . . 0 0 NA00002 . . 0|0 .
+20 1235237 . T . 0 . . . . . . . 0 0 NA00003 . . ./. .
+X 10 rsTest AC A,ATG 10 PASS . . . . . . 0 0 NA00001 . . 0 .
+X 10 rsTest AC A,ATG 10 PASS . . . . . . 0 0 NA00002 . . 0/1 .
+X 10 rsTest AC A,ATG 10 PASS . . . . . . 0 0 NA00003 . . 0|2 .
diff --git a/tool_collections/vcflib/vcf2tsv/test-data/vcf2tsv-test2.tab b/tool_collections/vcflib/vcf2tsv/test-data/vcf2tsv-test2.tab
new file mode 100644
index 00000000000..2169204f24d
--- /dev/null
+++ b/tool_collections/vcflib/vcf2tsv/test-data/vcf2tsv-test2.tab
@@ -0,0 +1,13 @@
+CHROM POS ID REF ALT QUAL FILTER AA AC AF AN DP NS DB H2
+19 111 . A C 9.6 . . . . . . . 0 0
+19 112 . A G 10 . . . . . . . 0 0
+20 14370 rs6054257 G A 29 PASS . . 0.5 . 14 3 1 1
+20 17330 . T A 3 q10 . . 0.017 . 11 3 0 0
+20 1110696 rs6040355 A G 67 PASS T . 0.333 . 10 2 1 0
+20 1110696 rs6040355 A T 67 PASS T . 0.667 . 10 2 1 0
+20 1230237 . T . 47 PASS T . . . 13 3 0 0
+20 1234567 microsat1 G GA 50 PASS G 3 . 6 9 3 0 0
+20 1234567 microsat1 G GAC 50 PASS G 1 . 6 9 3 0 0
+20 1235237 . T . 0 . . . . . . . 0 0
+X 10 rsTest AC A 10 PASS . . . . . . 0 0
+X 10 rsTest AC ATG 10 PASS . . . . . . 0 0
diff --git a/tool_collections/vcflib/vcf2tsv/test-data/vcflib.vcf b/tool_collections/vcflib/vcf2tsv/test-data/vcflib.vcf
new file mode 100644
index 00000000000..e8dd794d9bd
--- /dev/null
+++ b/tool_collections/vcflib/vcf2tsv/test-data/vcflib.vcf
@@ -0,0 +1,31 @@
+##fileformat=VCFv4.0
+##fileDate=20090805
+##source=myImputationProgramV3.1
+##reference=1000GenomesPilot-NCBI36
+##phasing=partial
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##FILTER=
+##FILTER=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##ALT=
+##ALT=
+#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT NA00001 NA00002 NA00003
+19 111 . A C 9.6 . . GT:HQ 0|0:10,10 0|0:10,10 0/1:3,3
+19 112 . A G 10 . . GT:HQ 0|0:10,10 0|0:10,10 0/1:3,3
+20 14370 rs6054257 G A 29 PASS NS=3;DP=14;AF=0.5;DB;H2 GT:GQ:DP:HQ 0|0:48:1:51,51 1|0:48:8:51,51 1/1:43:5:.,.
+20 17330 . T A 3 q10 NS=3;DP=11;AF=0.017 GT:GQ:DP:HQ 0|0:49:3:58,50 0|1:3:5:65,3 0/0:41:3:.,.
+20 1110696 rs6040355 A G,T 67 PASS NS=2;DP=10;AF=0.333,0.667;AA=T;DB GT:GQ:DP:HQ 1|2:21:6:23,27 2|1:2:0:18,2 2/2:35:4:.,.
+20 1230237 . T . 47 PASS NS=3;DP=13;AA=T GT:GQ:DP:HQ 0|0:54:.:56,60 0|0:48:4:51,51 0/0:61:2:.,.
+20 1234567 microsat1 G GA,GAC 50 PASS NS=3;DP=9;AA=G;AN=6;AC=3,1 GT:GQ:DP 0/1:.:4 0/2:17:2 1/1:40:3
+20 1235237 . T . . . . GT 0/0 0|0 ./.
+X 10 rsTest AC A,ATG 10 PASS . GT 0 0/1 0|2
diff --git a/tool_collections/vcflib/vcf2tsv/vcf2tsv.xml b/tool_collections/vcflib/vcf2tsv/vcf2tsv.xml
new file mode 100644
index 00000000000..90aa4c5206e
--- /dev/null
+++ b/tool_collections/vcflib/vcf2tsv/vcf2tsv.xml
@@ -0,0 +1,45 @@
+
+ Convert VCF data into TAB-delimited format
+
+ macros.xml
+
+
+
+ vcf2tsv ${g_option} -n '${null_filler}' '${input}' > '${out_file1}'
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+Converts VCF dataset to tab-delimited format, using null string to replace empty values in the table.
+Specifying "**Report data per sample**" (-g) will output one line per sample with genotype information.
+
+----
+
+Vcf2Tsv @IS_PART_OF_VCFLIB@
+
+
+
diff --git a/tool_collections/vcflib/vcfaddinfo/.shed.yml b/tool_collections/vcflib/vcfaddinfo/.shed.yml
new file mode 100644
index 00000000000..1e25fd2a5c8
--- /dev/null
+++ b/tool_collections/vcflib/vcfaddinfo/.shed.yml
@@ -0,0 +1,11 @@
+categories:
+- Variant Analysis
+description: Adds info fields from the second dataset which are not present in the
+ first dataset.
+homepage_url: https://github.com/ekg/vcflib
+long_description: |
+ Adds info fields from the second dataset which are not present in the first dataset.
+name: vcfaddinfo
+owner: devteam
+remote_repository_url: https://github.com/galaxyproject/tools-iuc/tree/master/tool_collections/vcflib/vcfaddinfo
+type: unrestricted
diff --git a/tool_collections/vcflib/vcfaddinfo/macros.xml b/tool_collections/vcflib/vcfaddinfo/macros.xml
new file mode 120000
index 00000000000..0c6ff6c3756
--- /dev/null
+++ b/tool_collections/vcflib/vcfaddinfo/macros.xml
@@ -0,0 +1 @@
+../macros.xml
\ No newline at end of file
diff --git a/tool_collections/vcflib/vcfaddinfo/test-data/vcfaddinfo-input1.vcf b/tool_collections/vcflib/vcfaddinfo/test-data/vcfaddinfo-input1.vcf
new file mode 100644
index 00000000000..f0aa4c60979
--- /dev/null
+++ b/tool_collections/vcflib/vcfaddinfo/test-data/vcfaddinfo-input1.vcf
@@ -0,0 +1,24 @@
+##fileformat=VCFv4.0
+##fileDate=20090805
+##source=myImputationProgramV3.1
+##reference=1000GenomesPilot-NCBI36
+##phasing=partial
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##FILTER=
+##FILTER=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##ALT=
+##ALT=
+#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT NA00001 NA00002 NA00003
+20 14370 rs6054257 G A 29 PASS NS=3;AF=0.5;DB;H2 GT:GQ:HQ 0|0:48:51,51 1|0:48:51,51 1/1:43:.,.
+20 17330 . T A 3 q10 NS=3;AF=0.017 GT:GQ:HQ 0|0:49:58,50 0|1:3:65,3 0/0:41:.,.
+X 10 rsTest AC A,ATG 10 PASS . GT 0 0/1 0|2
diff --git a/tool_collections/vcflib/vcfaddinfo/test-data/vcfaddinfo-test1.vcf b/tool_collections/vcflib/vcfaddinfo/test-data/vcfaddinfo-test1.vcf
new file mode 100644
index 00000000000..c782c8750e7
--- /dev/null
+++ b/tool_collections/vcflib/vcfaddinfo/test-data/vcfaddinfo-test1.vcf
@@ -0,0 +1,25 @@
+##fileformat=VCFv4.0
+##fileDate=20090805
+##source=myImputationProgramV3.1
+##reference=1000GenomesPilot-NCBI36
+##phasing=partial
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##FILTER=
+##FILTER=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##ALT=
+##ALT=
+##INFO=
+#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT NA00001 NA00002 NA00003
+20 14370 rs6054257 G A 29 PASS AF=0.5;DP=14;NS=3;DB;H2 GT:GQ:HQ 0|0:48:51,51 1|0:48:51,51 1/1:43:.,.
+20 17330 . T A 3 q10 AF=0.017;DP=11;NS=3 GT:GQ:HQ 0|0:49:58,50 0|1:3:65,3 0/0:41:.,.
+X 10 rsTest AC A,ATG 10 PASS . GT 0 0/1 0|2
diff --git a/tool_collections/vcflib/vcfaddinfo/test-data/vcflib.vcf b/tool_collections/vcflib/vcfaddinfo/test-data/vcflib.vcf
new file mode 100644
index 00000000000..e8dd794d9bd
--- /dev/null
+++ b/tool_collections/vcflib/vcfaddinfo/test-data/vcflib.vcf
@@ -0,0 +1,31 @@
+##fileformat=VCFv4.0
+##fileDate=20090805
+##source=myImputationProgramV3.1
+##reference=1000GenomesPilot-NCBI36
+##phasing=partial
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##FILTER=
+##FILTER=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##ALT=
+##ALT=
+#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT NA00001 NA00002 NA00003
+19 111 . A C 9.6 . . GT:HQ 0|0:10,10 0|0:10,10 0/1:3,3
+19 112 . A G 10 . . GT:HQ 0|0:10,10 0|0:10,10 0/1:3,3
+20 14370 rs6054257 G A 29 PASS NS=3;DP=14;AF=0.5;DB;H2 GT:GQ:DP:HQ 0|0:48:1:51,51 1|0:48:8:51,51 1/1:43:5:.,.
+20 17330 . T A 3 q10 NS=3;DP=11;AF=0.017 GT:GQ:DP:HQ 0|0:49:3:58,50 0|1:3:5:65,3 0/0:41:3:.,.
+20 1110696 rs6040355 A G,T 67 PASS NS=2;DP=10;AF=0.333,0.667;AA=T;DB GT:GQ:DP:HQ 1|2:21:6:23,27 2|1:2:0:18,2 2/2:35:4:.,.
+20 1230237 . T . 47 PASS NS=3;DP=13;AA=T GT:GQ:DP:HQ 0|0:54:.:56,60 0|0:48:4:51,51 0/0:61:2:.,.
+20 1234567 microsat1 G GA,GAC 50 PASS NS=3;DP=9;AA=G;AN=6;AC=3,1 GT:GQ:DP 0/1:.:4 0/2:17:2 1/1:40:3
+20 1235237 . T . . . . GT 0/0 0|0 ./.
+X 10 rsTest AC A,ATG 10 PASS . GT 0 0/1 0|2
diff --git a/tool_collections/vcflib/vcfaddinfo/vcfaddinfo.xml b/tool_collections/vcflib/vcfaddinfo/vcfaddinfo.xml
new file mode 100644
index 00000000000..8d4c52912a4
--- /dev/null
+++ b/tool_collections/vcflib/vcfaddinfo/vcfaddinfo.xml
@@ -0,0 +1,32 @@
+
+ Adds info fields from the second dataset which are not present in the first dataset
+
+ macros.xml
+
+
+
+ vcfaddinfo "${input1}" "${input2}" > "${out_file1}"
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+Adds info fields from the second dataset which are not present in the first dataset.
+
+-----
+
+Vcfaddinfo @IS_PART_OF_VCFLIB@
+
+
+
diff --git a/tool_collections/vcflib/vcfallelicprimitives/.shed.yml b/tool_collections/vcflib/vcfallelicprimitives/.shed.yml
new file mode 100644
index 00000000000..0d6ae31de22
--- /dev/null
+++ b/tool_collections/vcflib/vcfallelicprimitives/.shed.yml
@@ -0,0 +1,14 @@
+categories:
+- Variant Analysis
+description: Splits alleleic primitives (gaps or mismatches) into multiple VCF lines
+homepage_url: https://github.com/ekg/vcflib
+long_description: |
+ If multiple alleleic primitives (gaps or mismatches) are specified
+ in a single VCF record, this tools splits the record into multiple lines, but drops
+ all INFO fields. "Pure" MNPs are split into multiple SNPs unless the -m flag is
+ provided. Genotypes are phased where complex alleles have been decomposed, provided
+ genotypes in the input.
+name: vcfallelicprimitives
+owner: devteam
+remote_repository_url: https://github.com/galaxyproject/tools-iuc/tree/master/tool_collections/vcflib/vcfallelicprimitives
+type: unrestricted
diff --git a/tool_collections/vcflib/vcfallelicprimitives/macros.xml b/tool_collections/vcflib/vcfallelicprimitives/macros.xml
new file mode 120000
index 00000000000..0c6ff6c3756
--- /dev/null
+++ b/tool_collections/vcflib/vcfallelicprimitives/macros.xml
@@ -0,0 +1 @@
+../macros.xml
\ No newline at end of file
diff --git a/tool_collections/vcflib/vcfallelicprimitives/test-data/vcfallelicprimitives-test1.vcf b/tool_collections/vcflib/vcfallelicprimitives/test-data/vcfallelicprimitives-test1.vcf
new file mode 100644
index 00000000000..21a10473514
--- /dev/null
+++ b/tool_collections/vcflib/vcfallelicprimitives/test-data/vcfallelicprimitives-test1.vcf
@@ -0,0 +1,35 @@
+##fileformat=VCFv4.0
+##fileDate=20090805
+##source=myImputationProgramV3.1
+##reference=1000GenomesPilot-NCBI36
+##phasing=partial
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##FILTER=
+##FILTER=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##ALT=
+##ALT=
+##INFO=
+##INFO=
+##INFO=
+#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT NA00001 NA00002 NA00003
+19 111 . A C 9.6 . . GT:HQ 0|0:10,10 0|0:10,10 0/1:3,3
+19 112 . A G 10 . . GT:HQ 0|0:10,10 0|0:10,10 0/1:3,3
+20 14370 rs6054257 G A 29 PASS AF=0.5;DP=14;NS=3;DB;H2 GT:GQ:DP:HQ 0|0:48:1:51,51 1|0:48:8:51,51 1/1:43:5:.,.
+20 17330 . T A 3 q10 AF=0.017;DP=11;NS=3 GT:GQ:DP:HQ 0|0:49:3:58,50 0|1:3:5:65,3 0/0:41:3:.,.
+20 1110696 . A G,T 67 PASS AF=0.333,0.667;LEN=1,1;TYPE=snp,snp;Split primitives GT 1|2 2|1 2|2
+20 1230237 . T . 47 PASS AA=T;DP=13;NS=3 GT:GQ:DP:HQ 0|0:54:.:56,60 0|0:48:4:51,51 0/0:61:2:.,.
+20 1234567 . G GA,GAC 50 PASS AC=3,1;LEN=1,2;TYPE=ins,ins;Split primitives GT 0|1 0|2 1|1
+20 1235237 . T . 0 . . GT 0/0 0|0 ./.
+X 10 . AC ATC,A 10 PASS LEN=1,1;TYPE=ins,del;Split primitives GT 0 0|2 0|1
+X 11 . C G 10 PASS LEN=1;TYPE=snp;Split primitives GT 0 0|0 0|1
diff --git a/tool_collections/vcflib/vcfallelicprimitives/test-data/vcflib.vcf b/tool_collections/vcflib/vcfallelicprimitives/test-data/vcflib.vcf
new file mode 100644
index 00000000000..e8dd794d9bd
--- /dev/null
+++ b/tool_collections/vcflib/vcfallelicprimitives/test-data/vcflib.vcf
@@ -0,0 +1,31 @@
+##fileformat=VCFv4.0
+##fileDate=20090805
+##source=myImputationProgramV3.1
+##reference=1000GenomesPilot-NCBI36
+##phasing=partial
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##FILTER=
+##FILTER=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##ALT=
+##ALT=
+#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT NA00001 NA00002 NA00003
+19 111 . A C 9.6 . . GT:HQ 0|0:10,10 0|0:10,10 0/1:3,3
+19 112 . A G 10 . . GT:HQ 0|0:10,10 0|0:10,10 0/1:3,3
+20 14370 rs6054257 G A 29 PASS NS=3;DP=14;AF=0.5;DB;H2 GT:GQ:DP:HQ 0|0:48:1:51,51 1|0:48:8:51,51 1/1:43:5:.,.
+20 17330 . T A 3 q10 NS=3;DP=11;AF=0.017 GT:GQ:DP:HQ 0|0:49:3:58,50 0|1:3:5:65,3 0/0:41:3:.,.
+20 1110696 rs6040355 A G,T 67 PASS NS=2;DP=10;AF=0.333,0.667;AA=T;DB GT:GQ:DP:HQ 1|2:21:6:23,27 2|1:2:0:18,2 2/2:35:4:.,.
+20 1230237 . T . 47 PASS NS=3;DP=13;AA=T GT:GQ:DP:HQ 0|0:54:.:56,60 0|0:48:4:51,51 0/0:61:2:.,.
+20 1234567 microsat1 G GA,GAC 50 PASS NS=3;DP=9;AA=G;AN=6;AC=3,1 GT:GQ:DP 0/1:.:4 0/2:17:2 1/1:40:3
+20 1235237 . T . . . . GT 0/0 0|0 ./.
+X 10 rsTest AC A,ATG 10 PASS . GT 0 0/1 0|2
diff --git a/tool_collections/vcflib/vcfallelicprimitives/vcfallelicprimitives.xml b/tool_collections/vcflib/vcfallelicprimitives/vcfallelicprimitives.xml
new file mode 100644
index 00000000000..c5498fbbc1a
--- /dev/null
+++ b/tool_collections/vcflib/vcfallelicprimitives/vcfallelicprimitives.xml
@@ -0,0 +1,62 @@
+
+
+ Split alleleic primitives (gaps or mismatches) into multiple VCF lines
+
+ macros.xml
+
+
+
+
+ cat "${input}" | vcfallelicprimitives
+ ${m_option}
+ -t "${t_option}"
+ -L "${max_length}"
+ $keep_info
+ $keep_geno > "${out_file1}"
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+If multiple alleleic primitives (gaps or mismatches) are specified in a single VCF record, this tools splits the record into multiple lines, but drops all INFO fields.
+"Pure" MNPs are split into multiple SNPs unless the -m flag is provided.
+Genotypes are phased where complex alleles have been decomposed, provided genotypes in the input.
+
+The options are::
+
+ -m, --use-mnps Retain MNPs as separate events (default: false).
+ -t, --tag-parsed FLAG Tag records which are split apart of a complex allele with this flag.
+ -L, --max-length LEN Do not manipulate records in which either the ALT or
+ REF is longer than LEN (default: 200).
+ -k, --keep-info Maintain site and allele-level annotations when decomposing.
+ Note that in many cases, such as multisample VCFs, these won't
+ be valid post-decomposition. For biallelic loci in single-sample
+ VCFs, they should be usable with caution.
+ -g, --keep-geno Maintain genotype-level annotations when decomposing. Similar
+ caution should be used for this as for --keep-info.
+
+----
+
+Vcfallelicprimitives @IS_PART_OF_VCFLIB@
+
+
+
diff --git a/tool_collections/vcflib/vcfannotate/.shed.yml b/tool_collections/vcflib/vcfannotate/.shed.yml
new file mode 100644
index 00000000000..40c3537e71b
--- /dev/null
+++ b/tool_collections/vcflib/vcfannotate/.shed.yml
@@ -0,0 +1,11 @@
+categories:
+- Variant Analysis
+description: Intersect VCF records with BED annotations
+homepage_url: https://github.com/ekg/vcflib
+long_description: |
+ Intersects the records in the VCF file with targets provided in
+ a BED file. Intersections are done on the reference sequences in the VCF file.
+name: vcfannotate
+owner: devteam
+remote_repository_url: https://github.com/galaxyproject/tools-iuc/tree/master/tool_collections/vcflib/vcfannotate
+type: unrestricted
diff --git a/tool_collections/vcflib/vcfannotate/macros.xml b/tool_collections/vcflib/vcfannotate/macros.xml
new file mode 120000
index 00000000000..0c6ff6c3756
--- /dev/null
+++ b/tool_collections/vcflib/vcfannotate/macros.xml
@@ -0,0 +1 @@
+../macros.xml
\ No newline at end of file
diff --git a/tool_collections/vcflib/vcfannotate/test-data/vcfannotate-test1.vcf b/tool_collections/vcflib/vcfannotate/test-data/vcfannotate-test1.vcf
new file mode 100644
index 00000000000..91655f5a2c1
--- /dev/null
+++ b/tool_collections/vcflib/vcfannotate/test-data/vcfannotate-test1.vcf
@@ -0,0 +1,32 @@
+##fileformat=VCFv4.0
+##fileDate=20090805
+##source=myImputationProgramV3.1
+##reference=1000GenomesPilot-NCBI36
+##phasing=partial
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##FILTER=
+##FILTER=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##ALT=
+##ALT=
+##INFO=
+#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT NA00001 NA00002 NA00003
+19 111 . A C 9.6 . . GT:HQ 0|0:10,10 0|0:10,10 0/1:3,3
+19 112 . A G 10 . . GT:HQ 0|0:10,10 0|0:10,10 0/1:3,3
+20 14370 rs6054257 G A 29 PASS AF=0.5;DP=14;NS=3;DB;H2 GT:GQ:DP:HQ 0|0:48:1:51,51 1|0:48:8:51,51 1/1:43:5:.,.
+20 17330 . T A 3 q10 AF=0.017;DP=11;NS=3 GT:GQ:DP:HQ 0|0:49:3:58,50 0|1:3:5:65,3 0/0:41:3:.,.
+20 1110696 rs6040355 A G,T 67 PASS AA=T;AF=0.333,0.667;BED-features=uc010zpo.2:uc002wel.4:uc010zpp.2:uc002wen.4;DP=10;NS=2;DB GT:GQ:DP:HQ 1|2:21:6:23,27 2|1:2:0:18,2 2/2:35:4:.,.
+20 1230237 . T . 47 PASS AA=T;DP=13;NS=3 GT:GQ:DP:HQ 0|0:54:.:56,60 0|0:48:4:51,51 0/0:61:2:.,.
+20 1234567 microsat1 G GA,GAC 50 PASS AA=G;AC=3,1;AN=6;DP=9;NS=3 GT:GQ:DP 0/1:.:4 0/2:17:2 1/1:40:3
+20 1235237 . T . 0 . . GT 0/0 0|0 ./.
+X 10 rsTest AC A,ATG 10 PASS . GT 0 0/1 0|2
diff --git a/tool_collections/vcflib/vcfannotate/test-data/vcfannotate.bed b/tool_collections/vcflib/vcfannotate/test-data/vcfannotate.bed
new file mode 100644
index 00000000000..aa9c41604bf
--- /dev/null
+++ b/tool_collections/vcflib/vcfannotate/test-data/vcfannotate.bed
@@ -0,0 +1,46 @@
+20 123251 126392 uc002wcx.3 0 + 123269 126333 0 2 76,337, 0,2804,
+20 138110 139804 uc002wcy.2 0 + 138185 139665 0 2 124,390, 0,1304,
+20 168526 170264 uc002wcz.1 0 - 168526 170264 0 2 233,49, 0,1689,
+20 207898 210527 uc002wda.3 0 + 207929 210412 0 2 89,609, 0,2020,
+20 238376 241736 uc002wdb.3 0 + 238419 239947 0 2 101,2019, 0,1341,
+20 251503 271419 uc021vzl.1 0 - 251847 271244 0 11 405,119,87,102,158,100,159,119,118,49,192, 0,5105,5930,6181,6363,7467,8309,13100,18696,19396,19724,
+20 251503 271419 uc002wde.2 0 - 251847 271251 0 11 405,119,87,102,158,100,159,119,118,49,188, 0,5105,5930,6181,6363,7467,8309,13100,18696,19396,19728,
+20 251503 271419 uc010zpi.2 0 - 251847 270227 0 10 405,119,87,102,158,100,159,119,118,188, 0,5105,5930,6181,6363,7467,8309,13100,18696,19728,
+20 256608 271079 uc010zpj.1 0 - 257400 270981 0 7 912,102,158,100,159,119,180, 0,1076,1258,2362,3204,7995,14291,
+20 256608 271419 uc010zpk.2 0 - 257400 271232 0 7 912,102,158,100,159,119,188, 0,1076,1258,2362,3204,7995,14623,
+20 278203 280963 uc002wdf.3 0 + 278227 279442 0 1 2760, 0,
+20 306214 310872 uc002wdh.4 0 + 306568 307516 0 1 4658, 0,
+20 327369 335512 uc002wdi.4 0 + 330287 334279 0 4 417,115,195,1659, 0,2523,2912,6484,
+20 361307 378203 uc002wdm.3 0 + 368654 377334 0 4 506,291,293,1362, 0,7347,10623,15534,
+20 361940 378203 uc002wdn.3 0 + 363189 377334 0 5 204,194,291,293,1362, 0,1136,6714,9990,14901,
+20 388708 391408 uc002wdo.3 0 + 388708 388708 0 3 715,145,353, 0,1816,2347,
+20 388708 400504 uc010zpl.1 0 + 389401 400429 0 6 715,145,94,199,122,303, 0,1816,9461,9667,11282,11493,
+20 388708 409233 uc010zpm.1 0 + 388708 388708 0 9 715,145,94,199,122,174,112,180,99, 0,1816,9461,9667,11282,11493,14062,19248,20426,
+20 388708 411610 uc002wdp.4 0 + 389401 411074 0 12 715,145,94,199,122,174,161,112,180,99,144,617, 0,1816,9461,9667,11282,11493,12806,14062,19248,20426,20886,22285,
+20 388708 411610 uc002wdq.4 0 + 389382 411074 0 11 715,94,199,122,174,161,112,180,99,144,617, 0,9461,9667,11282,11493,12806,14062,19248,20426,20886,22285,
+20 388708 411610 uc010fzy.3 0 + 388708 388708 0 12 715,145,94,199,122,174,136,112,180,99,144,617, 0,1816,9461,9667,11282,11493,12806,14062,19248,20426,20886,22285,
+20 388708 411610 uc002wdr.4 0 + 398463 411074 0 10 715,94,199,122,174,112,180,99,144,617, 0,9461,9667,11282,11493,14062,19248,20426,20886,22285,
+20 416123 443187 uc002wds.3 0 - 419229 443049 0 8 3362,188,142,102,187,81,186,208, 0,3628,4768,6108,6377,9570,12409,26856,
+20 416123 443187 uc002wdt.3 0 - 416123 416123 0 10 491,159,292,188,142,102,187,81,186,208, 0,1960,3070,3628,4768,6108,6377,9570,12409,26856,
+20 416123 443187 uc002wdv.3 0 - 419229 422326 0 7 3362,188,142,456,81,186,208, 0,3628,4768,6108,9570,12409,26856,
+20 416125 417600 uc021vzm.1 0 - 416125 416125 0 1 1475, 0,
+20 463337 524482 uc002wdw.1 0 - 464604 489195 0 14 1383,87,149,101,102,111,84,60,51,102,112,210,117,167, 0,3682,4733,5984,7086,9560,13025,15027,16561,17139,22424,25757,45239,60978,
+20 463337 524482 uc002wdx.1 0 - 464604 489195 0 13 1383,87,149,101,102,111,84,60,51,102,112,210,167, 0,3682,4733,5984,7086,9560,13025,15027,16561,17139,22424,25757,60978,
+20 463337 524482 uc002wdy.1 0 - 464604 478382 0 12 1383,87,149,101,102,111,84,60,51,102,112,167, 0,3682,4733,5984,7086,9560,13025,15027,16561,17139,22424,60978,
+20 584636 590910 uc002wdz.3 0 - 585234 590881 0 2 673,554, 0,5720,
+20 627267 634014 uc002wea.4 0 - 629357 633829 0 2 2294,395, 0,6352,
+20 627267 656823 uc002web.4 0 - 629499 656245 0 3 2294,90,711, 0,7607,28845,
+20 642239 656823 uc002wec.3 0 - 644314 656245 0 2 2866,711, 0,13873,
+20 740723 749228 uc002wed.4 0 - 741669 746418 0 5 1159,124,506,618,288, 0,1621,3418,5128,8217,
+20 740724 749228 uc002wee.2 0 - 742293 746418 0 4 1744,506,618,288, 0,3417,5127,8216,
+20 814355 826922 uc002wef.1 0 + 825447 826335 0 2 239,1572, 0,10995,
+20 816710 826922 uc002weg.1 0 + 825447 826335 0 2 47,1572, 0,8640,
+20 825284 826922 uc002weh.1 0 + 825447 826335 0 1 1638, 0,
+20 853296 896960 uc002wei.3 0 - 853602 896857 0 9 467,131,167,102,116,248,122,156,412, 0,1630,5507,7093,8517,12424,15664,17559,43252,
+20 853296 896960 uc010zpn.2 0 - 853750 896839 0 8 467,167,102,116,248,122,156,412, 0,5507,7093,8517,12424,15664,17559,43252,
+20 939095 982907 uc002wej.3 0 - 940999 982807 0 5 2014,186,141,189,179, 0,5482,8721,9497,43633,
+20 939095 982907 uc002wek.3 0 - 940999 982807 0 4 2014,141,189,179, 0,8721,9497,43633,
+20 1093905 1147970 uc010zpo.2 0 + 1106275 1146898 0 7 147,153,83,186,54,159,1100, 0,12235,14163,21858,49868,51056,52965,
+20 1093905 1148426 uc002wel.4 0 + 1099416 1145724 0 8 147,150,153,83,186,54,159,2754, 0,5490,12235,14163,21858,49868,51056,51767,
+20 1099239 1147970 uc010zpp.2 0 + 1099416 1146898 0 6 306,153,83,54,159,1100, 0,6901,8829,44534,45722,47631,
+20 1099239 1148426 uc002wen.4 0 + 1099416 1145724 0 7 306,153,83,186,54,159,2754, 0,6901,8829,16524,44534,45722,46433,
diff --git a/tool_collections/vcflib/vcfannotate/test-data/vcflib.vcf b/tool_collections/vcflib/vcfannotate/test-data/vcflib.vcf
new file mode 100644
index 00000000000..e8dd794d9bd
--- /dev/null
+++ b/tool_collections/vcflib/vcfannotate/test-data/vcflib.vcf
@@ -0,0 +1,31 @@
+##fileformat=VCFv4.0
+##fileDate=20090805
+##source=myImputationProgramV3.1
+##reference=1000GenomesPilot-NCBI36
+##phasing=partial
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##FILTER=
+##FILTER=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##ALT=
+##ALT=
+#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT NA00001 NA00002 NA00003
+19 111 . A C 9.6 . . GT:HQ 0|0:10,10 0|0:10,10 0/1:3,3
+19 112 . A G 10 . . GT:HQ 0|0:10,10 0|0:10,10 0/1:3,3
+20 14370 rs6054257 G A 29 PASS NS=3;DP=14;AF=0.5;DB;H2 GT:GQ:DP:HQ 0|0:48:1:51,51 1|0:48:8:51,51 1/1:43:5:.,.
+20 17330 . T A 3 q10 NS=3;DP=11;AF=0.017 GT:GQ:DP:HQ 0|0:49:3:58,50 0|1:3:5:65,3 0/0:41:3:.,.
+20 1110696 rs6040355 A G,T 67 PASS NS=2;DP=10;AF=0.333,0.667;AA=T;DB GT:GQ:DP:HQ 1|2:21:6:23,27 2|1:2:0:18,2 2/2:35:4:.,.
+20 1230237 . T . 47 PASS NS=3;DP=13;AA=T GT:GQ:DP:HQ 0|0:54:.:56,60 0|0:48:4:51,51 0/0:61:2:.,.
+20 1234567 microsat1 G GA,GAC 50 PASS NS=3;DP=9;AA=G;AN=6;AC=3,1 GT:GQ:DP 0/1:.:4 0/2:17:2 1/1:40:3
+20 1235237 . T . . . . GT 0/0 0|0 ./.
+X 10 rsTest AC A,ATG 10 PASS . GT 0 0/1 0|2
diff --git a/tool_collections/vcflib/vcfannotate/vcfannotate.xml b/tool_collections/vcflib/vcfannotate/vcfannotate.xml
new file mode 100644
index 00000000000..f1876f42e1a
--- /dev/null
+++ b/tool_collections/vcflib/vcfannotate/vcfannotate.xml
@@ -0,0 +1,51 @@
+
+ Intersect VCF records with BED annotations
+
+ macros.xml
+
+
+
+ vcfannotate --key "${key_option}" -b "${bed_data}" "${input}" > "${out_file1}"
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+Intersect the records in the VCF file with intervals (features) provided in a BED file. Intersections are done on the reference sequences in the VCF file.
+
+-----
+
+.. class:: infomark
+
+
+**Example**:
+
+The following VCF line::
+
+ #CHROM POS ID REF ALT QUAL FILTER INFO FORMAT NA00001 NA00002 NA00003
+ 20 1110696 rs6040355 A G,T 67 PASS NS=2;DP=10;AF=0.333,0.667;AA=T;DB GT:GQ:DP:HQ 1|2:21:6:23,27 2|1:2:0:18,2 2/2:35:4:.,.
+
+will appear as the follwing after intersectuion with BED records uc010zpo.2, uc002wel.4, uc010zpp.2, and uc002wen::
+
+ #CHROM POS ID REF ALT QUAL FILTER INFO FORMAT NA00001 NA00002 NA00003
+ 20 1110696 rs6040355 A G,T 67 PASS AA=T;AF=0.333,0.667;BED-features=uc010zpo.2:uc002wel.4:uc010zpp.2:uc002wen.4;DP=10;NS=2;DB GT:GQ:DP:HQ 1|2:21:6:23,27 2|1:2:0:18,2 2/2:35:4:.,.
+
+----
+
+Vcfannotate @IS_PART_OF_VCFLIB@
+
+
+
diff --git a/tool_collections/vcflib/vcfannotategenotypes/.shed.yml b/tool_collections/vcflib/vcfannotategenotypes/.shed.yml
new file mode 100644
index 00000000000..77ef90bd9d0
--- /dev/null
+++ b/tool_collections/vcflib/vcfannotategenotypes/.shed.yml
@@ -0,0 +1,13 @@
+categories:
+- Variant Analysis
+description: 'Annotate genotypes in a VCF dataset using genotypes from another VCF dataset.'
+homepage_url: https://github.com/ekg/vcflib
+long_description: |
+ Annotates genotypes in the first file with genotypes in the second
+ adding the genotype as another flag to each sample filed in the first file. Annotation-tag
+ is the name of the sample flag which is added to store the annotation. also adds
+ a ''has\_variant'' flag for sites where the second file has a variant.
+name: vcfannotategenotypes
+owner: devteam
+remote_repository_url: https://github.com/galaxyproject/tools-iuc/tree/master/tool_collections/vcflib/vcfannotategenotypes
+type: unrestricted
diff --git a/tool_collections/vcflib/vcfannotategenotypes/macros.xml b/tool_collections/vcflib/vcfannotategenotypes/macros.xml
new file mode 120000
index 00000000000..0c6ff6c3756
--- /dev/null
+++ b/tool_collections/vcflib/vcfannotategenotypes/macros.xml
@@ -0,0 +1 @@
+../macros.xml
\ No newline at end of file
diff --git a/tool_collections/vcflib/vcfannotategenotypes/test-data/vcfannotategenotypes-input2.vcf b/tool_collections/vcflib/vcfannotategenotypes/test-data/vcfannotategenotypes-input2.vcf
new file mode 100644
index 00000000000..ebfcec99b4b
--- /dev/null
+++ b/tool_collections/vcflib/vcfannotategenotypes/test-data/vcfannotategenotypes-input2.vcf
@@ -0,0 +1,31 @@
+##fileformat=VCFv4.0
+##fileDate=20090805
+##source=myImputationProgramV3.1
+##reference=1000GenomesPilot-NCBI36
+##phasing=partial
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##FILTER=
+##FILTER=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##ALT=
+##ALT=
+#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT NA00001 NA00003
+19 111 . A C 9.6 . . GT:HQ 0|0:10,10 0/1:3,3
+19 112 . A G 10 . . GT:HQ 0|0:10,10 0/1:3,3
+20 14370 rs6054257 G A 29 PASS AF=0.5;DP=14;NS=3;DB;H2 GT:GQ:DP:HQ 0|0:48:1:51,51 1/1:43:5:.,.
+20 17330 . T A 3 q10 AF=0.017;DP=11;NS=3 GT:GQ:DP:HQ 0|0:49:3:58,50 0/0:41:3:.,.
+20 1110696 rs6040355 A G,T 67 PASS AA=T;AF=0.333,0.667;DP=10;NS=2;DB GT:GQ:DP:HQ 1|2:21:6:23,27 2/2:35:4:.,.
+20 1230237 . T . 47 PASS AA=T;DP=13;NS=3 GT:GQ:DP:HQ 0|0:54:.:56,60 0/0:61:2:.,.
+20 1234567 microsat1 G GA,GAC 50 PASS AA=G;AC=3,1;AN=6;DP=9;NS=3 GT:GQ:DP 0/1:.:4 1/1:40:3
+20 1235237 . T . 0 . . GT 0/0 .
+X 10 rsTest AC A,ATG 10 PASS . GT 0 0|2
diff --git a/tool_collections/vcflib/vcfannotategenotypes/test-data/vcfannotategenotypes-test1.vcf b/tool_collections/vcflib/vcfannotategenotypes/test-data/vcfannotategenotypes-test1.vcf
new file mode 100644
index 00000000000..f867da394e7
--- /dev/null
+++ b/tool_collections/vcflib/vcfannotategenotypes/test-data/vcfannotategenotypes-test1.vcf
@@ -0,0 +1,33 @@
+##fileformat=VCFv4.0
+##fileDate=20090805
+##source=myImputationProgramV3.1
+##reference=1000GenomesPilot-NCBI36
+##phasing=partial
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##FILTER=
+##FILTER=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##ALT=
+##ALT=
+##INFO=
+##FORMAT=
+#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT NA00001 NA00002 NA00003
+19 111 . A C 9.6 . added-genotypes.has_variant GT:HQ:added-genotypes 0|0:10,10:0/0 0|0:10,10:./. 0/1:3,3:0/1
+19 112 . A G 10 . added-genotypes.has_variant GT:HQ:added-genotypes 0|0:10,10:0/0 0|0:10,10:./. 0/1:3,3:0/1
+20 14370 rs6054257 G A 29 PASS AF=0.5;DP=14;NS=3;DB;H2;added-genotypes.has_variant GT:GQ:DP:HQ:added-genotypes 0|0:48:1:51,51:0/0 1|0:48:8:51,51:./. 1/1:43:5:.,.:1/1
+20 17330 . T A 3 q10 AF=0.017;DP=11;NS=3;added-genotypes.has_variant GT:GQ:DP:HQ:added-genotypes 0|0:49:3:58,50:0/0 0|1:3:5:65,3:./. 0/0:41:3:.,.:0/0
+20 1110696 rs6040355 A G,T 67 PASS AA=T;AF=0.333,0.667;DP=10;NS=2;DB;added-genotypes.has_variant GT:GQ:DP:HQ:added-genotypes 1|2:21:6:23,27:1/2 2|1:2:0:18,2:./. 2/2:35:4:.,.:2/2
+20 1230237 . T . 47 PASS AA=T;DP=13;NS=3;added-genotypes.has_variant GT:GQ:DP:HQ:added-genotypes 0|0:54:.:56,60:0/0 0|0:48:4:51,51:./. 0/0:61:2:.,.:0/0
+20 1234567 microsat1 G GA,GAC 50 PASS AA=G;AC=3,1;AN=6;DP=9;NS=3;added-genotypes.has_variant GT:GQ:DP:added-genotypes 0/1:.:4:0/1 0/2:17:2:./. 1/1:40:3:1/1
+20 1235237 . T . 0 . added-genotypes.has_variant GT:added-genotypes 0/0:0/0 0|0:./. ./.:.
+X 10 rsTest AC A,ATG 10 PASS added-genotypes.has_variant GT:added-genotypes 0:0 0/1:./. 0|2:0/2
diff --git a/tool_collections/vcflib/vcfannotategenotypes/test-data/vcflib.vcf b/tool_collections/vcflib/vcfannotategenotypes/test-data/vcflib.vcf
new file mode 100644
index 00000000000..e8dd794d9bd
--- /dev/null
+++ b/tool_collections/vcflib/vcfannotategenotypes/test-data/vcflib.vcf
@@ -0,0 +1,31 @@
+##fileformat=VCFv4.0
+##fileDate=20090805
+##source=myImputationProgramV3.1
+##reference=1000GenomesPilot-NCBI36
+##phasing=partial
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##INFO=
+##FILTER=
+##FILTER=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##FORMAT=
+##ALT=
+##ALT=
+#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT NA00001 NA00002 NA00003
+19 111 . A C 9.6 . . GT:HQ 0|0:10,10 0|0:10,10 0/1:3,3
+19 112 . A G 10 . . GT:HQ 0|0:10,10 0|0:10,10 0/1:3,3
+20 14370 rs6054257 G A 29 PASS NS=3;DP=14;AF=0.5;DB;H2 GT:GQ:DP:HQ 0|0:48:1:51,51 1|0:48:8:51,51 1/1:43:5:.,.
+20 17330 . T A 3 q10 NS=3;DP=11;AF=0.017 GT:GQ:DP:HQ 0|0:49:3:58,50 0|1:3:5:65,3 0/0:41:3:.,.
+20 1110696 rs6040355 A G,T 67 PASS NS=2;DP=10;AF=0.333,0.667;AA=T;DB GT:GQ:DP:HQ 1|2:21:6:23,27 2|1:2:0:18,2 2/2:35:4:.,.
+20 1230237 . T . 47 PASS NS=3;DP=13;AA=T GT:GQ:DP:HQ 0|0:54:.:56,60 0|0:48:4:51,51 0/0:61:2:.,.
+20 1234567 microsat1 G GA,GAC 50 PASS NS=3;DP=9;AA=G;AN=6;AC=3,1 GT:GQ:DP 0/1:.:4 0/2:17:2 1/1:40:3
+20 1235237 . T . . . . GT 0/0 0|0 ./.
+X 10 rsTest AC A,ATG 10 PASS . GT 0 0/1 0|2
diff --git a/tool_collections/vcflib/vcfannotategenotypes/vcfannotategenotypes.xml b/tool_collections/vcflib/vcfannotategenotypes/vcfannotategenotypes.xml
new file mode 100644
index 00000000000..b9a363d5b89
--- /dev/null
+++ b/tool_collections/vcflib/vcfannotategenotypes/vcfannotategenotypes.xml
@@ -0,0 +1,34 @@
+
+ Annotate genotypes in a VCF dataset using genotypes from another VCF dataset
+
+