<?xml version="1.0" encoding="UTF-8" ?>
<!-- RSS generated by PHPBoost on Mon, 01 Jun 2026 20:03:22 +0200 -->

<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
	<channel>
		<title><![CDATA[Easy Guides]]></title>
		<atom:link href="https://www.sthda.com/english/syndication/rss/wiki/17" rel="self" type="application/rss+xml"/>
		<link>https://www.sthda.com</link>
		<description><![CDATA[Last articles of the category: Genomic Toolkits]]></description>
		<copyright>(C) 2005-2026 PHPBoost</copyright>
		<language>en</language>
		<generator>PHPBoost</generator>
		
		
		<item>
			<title><![CDATA[From SRA to FASTQ file]]></title>
			<link>https://www.sthda.com/english/wiki/from-sra-to-fastq-file</link>
			<guid>https://www.sthda.com/english/wiki/from-sra-to-fastq-file</guid>
			<description><![CDATA[<!-- START HTML -->
  <!--====================== start from here when you copy to sthda================-->  
  <div id="rdoc">



<div id="TOC">
<ul>
<li><a href="#objectives">Objectives</a></li>
<li><a href="#download-sra-file">Download SRA file</a></li>
<li><a href="#convert-sra-to-fastq-format">Convert SRA to FASTQ format</a></li>
</ul>
</div>

<hr />
<p><span class="warning"> This analysis was performed using R (ver. 3.1.0). </span></p>
<div id="objectives" class="section level2">
<h2>Objectives</h2>
<ul>
<li>Download automatically sequencing data from <strong>Short Read Archive (SRA)</strong></li>
<li>Convert SRA to FASTQ file</li>
</ul>
</div>
<div id="download-sra-file" class="section level2">
<h2>Download SRA file</h2>
<p>The data from <a href="http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE57478">Popovic et al., (GEO accession number: GSE57478)</a> where used in the following example. The SRA files are available here : <a href="http://www.ncbi.nlm.nih.gov/sra?term=SRP032510"><a href="http://www.ncbi.nlm.nih.gov/sra?term=SRP032510">http://www.ncbi.nlm.nih.gov/sra?term=SRP032510</a></a>.</p>
<p><span class="success">The downloaded process can be automated using R.</span></p>
<p><strong>STEP 1. Download a table of the metadata into a CSV file “SraRunInfo.csv”:</strong></p>
<p>From <a href="http://www.ncbi.nlm.nih.gov/sra?term=SRP032510">SRA</a> web page :</p>
<p>click on “Send to (top right corner)” <i class="fa fa-long-arrow-right"></i> Select “File” <i class="fa fa-long-arrow-right"></i> Select format “RunInfo” <i class="fa fa-long-arrow-right"></i> Click on “Create File”</p>
<p><strong>STEP 2. Read this CSV file “SraRunInfo.csv” into R:</strong></p>
<p>The SRA files are automatically download in the current working directory using the following R script:</p>
<pre class="r"><code>#Read SRA file infos
sri<-read.csv("SraRunInfo.csv", stringsAsFactors=FALSE)
files<-basename(sri$download_path)
for(i in 1:length(files)) download.file(sri$download_path[i], files[i])</code></pre>
<p><span class="warning">This article describes just one way to automate the download of SRA files from R. Users can also use <strong>wget</strong> (Unix/Linux) or <strong>curl</strong> (MAC OS X) or download from web browser </span></p>
<pre class="bash"><code>wget ftp://ftp-trace.ncbi.nlm.nih.gov/sra/sra-instant/reads/ByRun/sra/SRR/SRR128/SRR1282056/SRR1282056.sra</code></pre>
</div>
<div id="convert-sra-to-fastq-format" class="section level2">
<h2>Convert SRA to FASTQ format</h2>
<p>To convert the example data to <strong>FASTQ</strong>, use the <strong>fastq-dump</strong> command from the <strong>SRA Toolkit</strong> on each SRA file. To install SRA Toolkit click <a href="https://www.sthda.com/english/english/wiki/install-sra-toolkit">here</a>.</p>
<p>R can be used to construct the required shell commands and to automate the process, starting from the SraRunInfo.csv" metadata table, as follows:</p>
<pre class="r"><code># Assure that all the files has been downloaded successfully
# Remember, the R object files has been created in the previous code chunk
stopifnot( all(file.exists(files)) ) 
for(f in files) {
  cmd = paste("fastq-dump --split-3", f)
  cat(cmd,"\n")#print the current command
  system(cmd) # invoke command
}</code></pre>
<p><span class="warning"><strong>fastq-dumb</strong> can be also used manually into the Unix Shell.</span></p>
<pre class="bash"><code>fastq-dump --split-3 SRR1282056.sra </code></pre>
<p><span class="error"> Be sure to use the <strong>–split-3</strong> option, which splits mate-pair reads into separate files. After this command, single and paired-end data will produce one or two FASTQ files, respectively. For paired-end data, the file names will be suffixed 1.FASTQ and 2.FASTQ; otherwise, a single file with extension .FASTQ will be produced </span></p>
</div>

<script>jQuery(document).ready(function () {jQuery('h1,h2,h3,h4').addClass('formatter-title');});//add phpboost class to header</script>
<style>.content{padding:0px;}</style>
</div><!--end rdoc-->
<!--====================== stop here when you copy to sthda================-->

<!-- END HTML --><br />
<br />
<strong>See the video:</strong><br />
<p style="text-align: center;"><iframe class="youtube-player" type="text/html" width="560" height="315" src="https://www.youtube.com/embed/y5ZBj4j6KHo" frameborder="0" allowfullscreen></iframe></p>]]></description>
			<pubDate>Sun, 05 Oct 2014 11:17:38 +0200</pubDate>
			
		</item>
		
		<item>
			<title><![CDATA[Install SRA toolkit]]></title>
			<link>https://www.sthda.com/english/wiki/install-sra-toolkit</link>
			<guid>https://www.sthda.com/english/wiki/install-sra-toolkit</guid>
			<description><![CDATA[SRA toolkit contains important tools to manipulate SRA (Short Read Archive) file.<br />
<br />
The objective of this article is to show you, how to install SRA toolkit on Ubuntu/Linux system.<br />
<br />
<br />
<h2 class="formatter-title">Installation</h2><br />
<br />
1. Download the last version for your computer operating system from <a href="http://www.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?view=software">here</a><br />
<br />
Use the following command on Linux to download the file sratoolkit.2.4.1-ubuntu64.tar.gz:<br />
<br />
<div class="formatter-container formatter-code code-BASH"><span class="formatter-title">Code BASH :</span><div class="formatter-content"><pre style="display:inline;"><pre class="bash" style="font-family:monospace;">&amp;nbsp;
<span style="color: #666666; font-style: italic;">#Download the file for ubuntu system</span>
<span style="color: #c20cb9; font-weight: bold;">wget</span> http:<span style="color: #000000; font-weight: bold;">//</span>ftp-trace.ncbi.nlm.nih.gov<span style="color: #000000; font-weight: bold;">/</span>sra<span style="color: #000000; font-weight: bold;">/</span>sdk<span style="color: #000000; font-weight: bold;">/</span>2.4.1<span style="color: #000000; font-weight: bold;">/</span>sratoolkit.2.4.1-ubuntu64.tar.gz
<span style="color: #666666; font-style: italic;"># Unzip the archive</span>
<span style="color: #c20cb9; font-weight: bold;">tar</span> xzvf sratoolkit.2.4.1-ubuntu64.tar.gz
&amp;nbsp;</pre></pre></div></div><br />
<br />
<br />
2. Modify your .bashrc file so that when you type "fastq-dump" , for example, it calls the program:<br />
<div class="formatter-container formatter-code code-BASH"><span class="formatter-title">Code BASH :</span><div class="formatter-content"><pre style="display:inline;"><pre class="bash" style="font-family:monospace;"><span style="color: #7a0874; font-weight: bold;">export</span> <span style="color: #007800;">PATH</span>=<span style="color: #007800;">$PATH</span>:<span style="color: #000000; font-weight: bold;">/</span>directory<span style="color: #000000; font-weight: bold;">/</span>sratoolkit.2.4.1-ubuntu64<span style="color: #000000; font-weight: bold;">/</span>bin </pre></pre></div></div><br />
<br />
<span class="warning"><br />
Directory is equal to the directory that you have installed sratoolkit  in. Example:<br />
<div class="formatter-container formatter-code code-BASH"><span class="formatter-title">Code BASH :</span><div class="formatter-content"><pre style="display:inline;"><pre class="bash" style="font-family:monospace;"><span style="color: #7a0874; font-weight: bold;">export</span> <span style="color: #007800;">PATH</span>=<span style="color: #007800;">$PATH</span>:<span style="color: #000000; font-weight: bold;">/</span>home<span style="color: #000000; font-weight: bold;">/</span>kassambara<span style="color: #000000; font-weight: bold;">/</span>Documents<span style="color: #000000; font-weight: bold;">/</span>sequencing<span style="color: #000000; font-weight: bold;">/</span>tools<span style="color: #000000; font-weight: bold;">/</span>sratoolkit.2.4.1-ubuntu64<span style="color: #000000; font-weight: bold;">/</span>bin </pre></pre></div></div><br />
</span>]]></description>
			<pubDate>Sun, 05 Oct 2014 02:54:12 +0200</pubDate>
			
		</item>
		
		<item>
			<title><![CDATA[Install samtools on UNIX system]]></title>
			<link>https://www.sthda.com/english/wiki/install-samtools-on-unix-system</link>
			<guid>https://www.sthda.com/english/wiki/install-samtools-on-unix-system</guid>
			<description><![CDATA[Samtools is a useful tool for manipulating and visualizing .bam files.<br />
<br />
<h2 class="formatter-title">Installation</h2><br />
<br />
<br />
1. Download the most current version from the <a href="http://sourceforge.net/projects/samtools/files/samtools/">Samtools</a> website.<br />
<br />
2. Unzip the file:<br />
<div class="formatter-container formatter-code code-BASH"><span class="formatter-title">Code BASH :</span><div class="formatter-content"><pre style="display:inline;"><pre class="bash" style="font-family:monospace;"><span style="color: #c20cb9; font-weight: bold;">tar</span> xvjf samtools-<span style="color: #000000;">1.1</span>.tar.bz2 </pre></pre></div></div><br />
<br />
3. Go into the newly created directory and compile the code by typing <strong>make</strong>:<br />
<div class="formatter-container formatter-code code-BASH"><span class="formatter-title">Code BASH :</span><div class="formatter-content"><pre style="display:inline;"><pre class="bash" style="font-family:monospace;">&amp;nbsp;
<span style="color: #7a0874; font-weight: bold;">cd</span> samtools-<span style="color: #000000;">1.1</span>
<span style="color: #c20cb9; font-weight: bold;">make</span>
&amp;nbsp;</pre></pre></div></div><br />
<br />
4. Modify your .bashrc file so that when you type "samtools" it calls the program:<br />
<div class="formatter-container formatter-code code-BASH"><span class="formatter-title">Code BASH :</span><div class="formatter-content"><pre style="display:inline;"><pre class="bash" style="font-family:monospace;"><span style="color: #7a0874; font-weight: bold;">export</span> <span style="color: #007800;">PATH</span>=<span style="color: #007800;">$PATH</span>:<span style="color: #000000; font-weight: bold;">/</span>directory<span style="color: #000000; font-weight: bold;">/</span>samtools-0.1.19 </pre></pre></div></div><br />
<br />
<span class="warning">Directory is equal to the directory that you have installed Samtools in.</span>]]></description>
			<pubDate>Sun, 05 Oct 2014 02:48:49 +0200</pubDate>
			
		</item>
		
		<item>
			<title><![CDATA[Genomic Toolkits]]></title>
			<link>https://www.sthda.com/english/wiki/genomic-toolkits</link>
			<guid>https://www.sthda.com/english/wiki/genomic-toolkits</guid>
			<description><![CDATA[Tools and scripts for genomic data analysis]]></description>
			<pubDate>Fri, 03 Oct 2014 23:39:46 +0200</pubDate>
			
		</item>
		
	</channel>
</rss>
