hgdownload.soe.ucsc.eduIndex of /goldenPath/petMar2/multiz7way
This file is from:
http://hgdownload.cse.ucsc.edu/goldenPath/petMar2/multiz7way/README.txt
This directory contains compressed multiple alignments of the
following assemblies to the lamprey genome (petMar2/WUGSC 7.0, Sep. 2012)
(alignment
type)
Lamprey - Petromyzon marinus Sep. 2012 (WUGSC 7.0/petMar2) reference
Chicken - Gallus gallus Nov. 2011 (ICGSC 4.0/galGal4) net
Medaka - Oryzias latipes Oct. 2005 (NIG/UT Medaka1/oryLat2) net
Opossum - Monodelphis domestica Oct. 2006 (Broad/monDom5) net
Human - Homo sapiens Feb. 2009 (GRCh37/hg19) net
Mouse - Mus musculus Dec. 2011 (GRCm38/mm10) net
Lancelet - Branchiostoma floridae Apr. 2008 (JGI v2.0/braFlo2) net
---------------------------------------------------------------
These alignments were prepared using the methods described in the
track description file:
http://genome.ucsc.edu/cgi-bin/hgTrackUi?db=petMar2&g=cons7way
based on the phylogenetic tree: petMar2.7way.nh.
Files in this directory:
- petMar2.7way.nh - phylogenetic tree used during the multiz multiple alignment
- petMar2.7way.commonNames.nh - same as petMar2.7way.nh with the UCSC database
name replaced by the common name for the species
- upstream1000.xenoRefGene.maf.gz - alignments in regions upstream, see below
- upstream2000.xenoRefGene.maf.gz - alignments in regions upstream, see below
- upstream5000.xenoRefGene.maf.gz - alignments in regions upstream, see below
See also:
http://genomewiki.ucsc.edu/index.php/PetMar2_conservation_alignment
The two files: xenoRefGene.exonAA.fa.gz xenoRefGene.exonNuc.fa.gz
contain compressed FASTA alignments for the Xeno RefSeq CDS regions
of the lamprey genome (petMar2, Apr. 2012) aligned to the assemblies.
The petMar2.7way.maf.gz file contains the alignments to
the lamprey assembly, with additional annotations to
indicate gap context, and genomic breaks for the
sequence in the underlying genome assemblies. Beware, the compressed
data size of these this is 140 Mb, uncompressed is approximately 574 Mb.
The upstream*.maf.gz files contain alignments in regions upstream of
annotated transcription starts for Xeno RefSeq genes with annotated 5' UTRs.
These files differ from the standard MAF format: they display
alignments that extend from start to end of the upstream region in
mouse, whether or not alignments actually exist. In situations where no
alignments exist or the alignments of one or more species are missing,
dot (".") is used as a placeholder. Multiple regions of an assembly's
sequence may align to a single region in the lamprey; therefore,
only the species name is displayed in the alignment data and no position
information is recorded. The alignment score is always zero in these files.
These files are updated weekly.
For a description of multiple alignment format (MAF), see
http://genome.ucsc.edu/goldenPath/help/maf.html.
PhastCons conservation scores for these alignments are available at:
http://hgdownload.cse.ucsc.edu/goldenPath/petMar2/phastCons7way
PhyloP conservation scores for these alignments are available at:
http://hgdownload.cse.ucsc.edu/goldenPath/petMar2/phyloP7way
---------------------------------------------------------------
To download a large file or multiple files from this directory, we recommend
that you use rsync or ftp rather than downloading the files via our website.
There is approximately 31 Gb of compressed data in this directory.
Via rsync:
rsync -avz --progress \
rsync://hgdownload.cse.ucsc.edu/goldenPath/petMar2/multiz7way/ ./
Via FTP:
ftp hgdownload.cse.ucsc.edu
user name: anonymous
password: <your email address>
go to the directory goldenPath/petMar2/multiz7way
To download multiple files from the UNIX command line, use the "mget" command.
mget <filename1> <filename2> ...
- or -
mget -a (to download all the files in the directory)
Use the "prompt" command to toggle the interactive mode if you do not want
to be prompted for each file that you download.
---------------------------------------------------------------
All the files in this directory are freely usable for any
purpose. For data use restrictions regarding the individual
genome assemblies, see http://genome.ucsc.edu/goldenPath/credits.html.
---------------------------------------------------------------
Name Last modified Size Description
Parent Directory -
maf/ 2019-11-06 11:12 -
md5sum.txt 2012-11-07 15:08 505
petMar2.7way.commonNames.nh 2012-10-30 15:56 173
petMar2.7way.maf.gz 2012-10-31 11:38 140M
petMar2.7way.nh 2012-10-30 15:54 161
upstream1000.xenoRefGene.maf.gz 2012-11-07 11:45 158K
upstream2000.xenoRefGene.maf.gz 2012-11-07 11:45 276K
upstream5000.xenoRefGene.maf.gz 2012-11-07 11:45 1.0M
xenoRefGene.exonAA.fa.gz 2012-11-07 15:03 43M
xenoRefGene.exonNuc.fa.gz 2012-11-07 15:03 67M