GEO Accession viewer

NCBI > GEO > Accession Display

Not logged in | Login

GEO help: Mouse over screen elements for information.

Sample GSM1831776

Query DataSets for GSM1831776

Status

Public on Aug 12, 2016

Title

ScaleUpDesign2_SV40P_Plasmid

Sample type

SRA

Source name

Plasmid pool

Organism

Escherichia coli

Characteristics

source: plasmid
promoter: SV40

Extracted molecule

genomic DNA

Extraction protocol

Total RNA was isolated from cell lysates using RNeasy kits (Qiagen). mRNA was extracted from total RNA using MicroPoly(A)Purist™ kits (Ambion) and treated with DNase I using the Turbo DNA-free™ kit (Ambion).
First-strand cDNA was synthesized from 400-700 ng mRNA using High Capacity RNA-to-cDNA kits (Applied Biosystems). Tag-Seq sequencing libraries were generated directly from 12% of a cDNA reaction or 50 ng plasmid DNA by 26 cycle PCR using Pfu Ultra HS DNA polymerase 2x master mix (Agilent) and primers AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCT and CAAGCAGAAGACGGCATACGAGATXXXXXXXXGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCTCGAGGTGCCTAAAGG (where XXXXXXXX is a library-specific index sequence). The resultant PCR products were size-selected using 2% agarose E-Gel EX (Invitrogen).

Library strategy

OTHER

Library source

genomic

Library selection

other

Instrument model

Illumina HiSeq 2500

Description

Scale-up Design 2 (5-bp steps), MPRA plasmid pool, replicate 1 and 2

Data processing

To infer the tag copy numbers in each Tag-Seq library, all sequence reads were examined, regardless of their quality scores. If the first ten nucleotides of a read perfectly matched one of the 54K (pilot) or ~ 244K (scale-up) designed tags and the remaining nucleotides matched the expected upstream MPRA construct sequence, this was counted as one occurrence of that tag. All reads that did not meet this criterion were discarded.
Genome_build: hg19 for coordinates in *ScaleUpDesign*.counts.txt file
Supplementary_files_format_and_content: The first tab delimited column contains the construct ID, the second column the DNA sequence to which it corresponds, the third column all the barcode tag IDs in comma delimited form if more than one, the fourth column the corresponding counts for the tags also in comma delimited form if more than one. In the ScaleUpDesign construct IDs are of form CELLTYPE_STATE_REGIONINSTATE_TILEPOS_chr_center where CELLTYPE and STATE are the cell type and chromatin state based on which the regulatory region was selected, REGIONINSTATE is an ID among selections from the CELLTYPE and STATE combination, TILEPOS is the tile position and ranges from 0 to 30 where TILEPOS=0 is the first tile, and chr and center and the chromosome and coordinate of the center base (hg19) of the regulatory region being tiled. In the PilotDesign construct IDs are of the form GROUP_REGIONIDINGROUP_TILEPOS. GROUP is from the set {high,range,controlhigh} where high corresponds to regions selected based on being in candidate enhancer chromatin states in HepG2 with the strongest H3K27ac dip scores, range represents enhancers with a range of dip scores, and controlhigh are locations in candidate enhancers in K562 but not HepG2. REGIONIDINGROUP is an ID among selections in the GROUP. TILEPOS is the tile position and ranges from 0 to 8 where TILEPOS=0 is the first tile.

Submission date

Jul 23, 2015

Last update date

May 15, 2019

Contact name

Jason Ernst

E-mail(s)

jason.ernst@ucla.edu

Organization name

UCLA

Department

Biological Chemistry

Street address

615 Charles E Young Dr South

City

Los Angeles

State/province

ZIP/Postal code

90095

Country

USA

Platform ID

GPL18133

Series (1)

GSE71279

Genome-scale high-resolution mapping of activating and repressive nucleotides in regulatory regions

Relations

BioSample

SAMN03897404

SRA

SRX1116943

Supplementary file	Size	Download	File type/resource
GSM1831776_ScaleUpDesign2_SV40P_Plasmid.counts.txt.gz	4.1 Mb	(ftp)(http)	TXT
SRA Run Selector
Raw data are available in SRA
Processed data provided as supplementary file