×

COL1A1 cDNA ORF clone, Homo sapiens (human)

Gene Symbol COL1A1
Entrez Gene ID 1277
Full Name collagen, type I, alpha 1
Synonyms EDSC, OI1, OI2, OI3, OI4
General protein information
Preferred Names
collagen alpha-1(I) chain
Names
collagen alpha-1(I) chain
alpha-1 type I collagen
pro-alpha-1 collagen type 1
collagen alpha 1 chain type I
collagen alpha-1(I) chain preproprotein
collagen of skin, tendon and bone, alpha-1 chain
Gene Type protein-coding
Organism Homo sapiens (human)
Genome

17

17q21.33

Summary This gene encodes the pro-alpha1 chains of type I collagen whose triple helix comprises two alpha1 chains and one alpha2 chain. Type I is a fibril-forming collagen found in most connective tissues and is abundant in bone, cornea, dermis and tendon. Mutations in this gene are associated with osteogenesis imperfecta types I-IV, Ehlers-Danlos syndrome type VIIA, Ehlers-Danlos syndrome Classical type, Caffey Disease and idiopathic osteoporosis. Reciprocal translocations between chromosomes 17 and 22, where this gene and the gene for platelet-derived growth factor beta are located, are associated with a particular type of skin tumor called dermatofibrosarcoma protuberans, resulting from unregulated expression of the growth factor. Two transcripts, resulting from the use of alternate polyadenylation signals, have been identified for this gene. [provided by R. Dalgleish, Feb 2008]. lac of sum
Disorder MIM:

120150

Disorder Html: Osteogenesis imperfecta, type I, 166200 (3); OI type II, 166210 (3);

mRNA and Protein(s)

mRNA Protein Name
XM_011524341 XP_011522643 collagen alpha-1(I) chain isoform X1
XM_005257058 XP_005257115 collagen alpha-1(I) chain isoform X2
XM_005257059 XP_005257116 collagen alpha-1(I) chain isoform X3
NM_000088 NP_000079 collagen alpha-1(I) chain preproprotein



Homo sapiens (human) COL1A1 NP_000079.2
Pan troglodytes (chimpanzee) COL1A1 XP_001169409.1
Macaca mulatta (Rhesus monkey) COL1A1 XP_001096194.2
Canis lupus familiaris (dog) COL1A1 NP_001003090.1
Bos taurus (cattle) COL1A1 NP_001029211.1
Mus musculus (house mouse) Col1a1 NP_031768.2
Rattus norvegicus (Norway rat) Col1a1 NP_445756.1
Danio rerio (zebrafish) col1a1a NP_954684.1
Xenopus (Silurana) tropicalis (western clawed frog) LOC496414 NP_001011005.1


GeneRIFs: Gene References Into Functions What's a GeneRIF?

The following COL1A1 gene cDNA ORF clone sequences were retrieved from the NCBI Reference Sequence Database (RefSeq). These sequences represent the protein coding region of the COL1A1 cDNA ORF which is encoded by the open reading frame (ORF) sequence. ORF sequences can be delivered in our standard vector, pcDNA3.1+/C-(K)DYK or the vector of your choice as an expression/transfection-ready ORF clone. Not the clone you want? Click here to find your clone.

***CloneID RefSeq Accession Definition **Vector *Turnaround time Price Select
OHu56962 XM_011524341 PREDICTED: Homo sapiens collagen, type I, alpha 1 (COL1A1), transcript variant X1, mRNA. pcDNA3.1+/C-(K)DYK or customized vector TBD $727.30
$1039.00
OHu50114 XM_005257058 PREDICTED: Homo sapiens collagen, type I, alpha 1 (COL1A1), transcript variant X2, mRNA. pcDNA3.1+/C-(K)DYK or customized vector 25 $727.30
$1039.00
OHu50115 XM_005257059 PREDICTED: Homo sapiens collagen, type I, alpha 1 (COL1A1), transcript variant X3, mRNA. pcDNA3.1+/C-(K)DYK or customized vector 20 $559.30
$799.00
OHu25462 NM_000088 Homo sapiens collagen, type I, alpha 1 (COL1A1), mRNA. pcDNA3.1+/C-(K)DYK or customized vector 25 $69.30
$99.00
Next-day Shipping ORF Clones ( in default vector with tag)
1 Clone 30% OFF
2-4 Clone 40% OFF
5 or more Clone 50% OFF
All Other ORF Clones
30% OFF

*Business Day

** You may select a custom vector to replace pcDNA3.1+/C-(K)DYK after clone is added to cart.

** GenScript guarantees 100% sequence accuracy of all synthetic DNA constructs we deliver, but we do not guarantee protein expression in your experimental system. Protein expression is influenced by many factors that may vary between experiments or laboratories. In addition, please pay attention to the signal peptide, propeptide and transit peptide in target ORF, which may affect the choice of vector (N/C terminal tag vector).

***One clone ID might be correlated to multiple accession numbers, which share the same CDS sequence.


CloneID OHu56962
Clone ID Related Accession (Same CDS sequence) XM_011524341
Accession Version XM_011524341.1 Documents for ORF clone product in dufault vector
Sequence Information ORF Nucleotide Sequence (Length: 4197bp)
Protein sequence
SNP
Vector pcDNA3.1+/C-(K)DYK or customized vector User Manual
Clone information Clone Map MSDS
Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
ORF Insert Method CloneEZ® Seamless cloning technology
Structure linear
Update Date 12-MAR-2015
Organism Homo sapiens (human)
Product collagen alpha-1(I) chain isoform X1
Comment MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NT_010783.16) annotated using gene prediction method: Gnomon, supported by mRNA and EST evidence. Also see: Documentation of NCBI's Annotation Process ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Status :: Full annotation Annotation Version :: Homo sapiens Annotation Release 107 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 6.2 Annotation Method :: Best-placed RefSeq; Gnomon Features Annotated :: Gene; mRNA; CDS; ncRNA ##Genome-Annotation-Data-END##

1
61
121
181
241
301
361
421
481
541
601
661
721
781
841
901
961
1021
1081
1141
1201
1261
1321
1381
1441
1501
1561
1621
1681
1741
1801
1861
1921
1981
2041
2101
2161
2221
2281
2341
2401
2461
2521
2581
2641
2701
2761
2821
2881
2941
3001
3061
3121
3181
3241
3301
3361
3421
3481
3541
3601
3661
3721
3781
3841
3901
3961
4021
4081
4141
ATGTTCAGCT TTGTGGACCT CCGGCTCCTG CTCCTCTTAG CGGCCACCGC CCTCCTGACG 
CACGGCCAAG AGGAAGGCCA AGTCGAGGGC CAAGACGAAG ACATCCCACC AATCACCTGC
GTACAGAACG GCCTCAGGTA CCATGACCGA GACGTGTGGA AACCCGAGCC CTGCCGGATC
TGCGTCTGCG ACAACGGCAA GGTGTTGTGC GATGACGTGA TCTGTGACGA GACCAAGAAC
TGCCCCGGCG CCGAAGTCCC CGAGGGCGAG TGCTGTCCCG TCTGCCCCGA CGGCTCAGAG
TCACCCACCG ACCAAGAAAC CACCGGCGTC GAGGGACCCA AGGGAGACAC TGGCCCCCGA
GGCCCAAGGG GACCCGCAGG CCCCCCTGGC CGAGATGGCA TCCCTGGACA GCCTGGACTT
CCCGGACCCC CCGGACCCCC CGGACCTCCC GGACCCCCTG GCCTCGGAGG AAACTTTGCT
CCCCAGCTGT CTTATGGCTA TGATGAGAAA TCAACCGGAG GAATTTCCGT GCCTGGCCCC
ATGGGTCCCT CTGGTCCTCG TGGTCTCCCT GGCCCCCCTG GTGCACCTGG TCCCCAAGGC
TTCCAAGGTC CCCCTGGTGA GCCTGGCGAG CCTGGAGCTT CAGGTCCCAT GGGTCCCCGA
GGTCCCCCAG GTCCCCCTGG AAAGAATGGA GATGATGGGG AAGCTGGAAA ACCTGGTCGT
CCTGGTGAGC GTGGGCCTCC TGGGCCTCAG GGTGCTCGAG GATTGCCCGG AACAGCTGGC
CTCCCTGGAA TGAAGGGACA CAGAGGTTTC AGTGGTTTGG ATGGTGCCAA GGGAGATGCT
GGTCCTGCTG GTCCTAAGGG TGAGCCTGGC AGCCCTGGTG AAAATGGAGC TCCTGGTCAG
ATGGGCCCCC GTGGCCTGCC TGGTGAGAGA GGTCGCCCTG GAGCCCCTGG CCCTGCTGGA
AACCCTGGTG CTGATGGACA GCCTGGTGCT AAAGGTGCCA ATGGTGCTCC TGGTATTGCT
GGTGCTCCTG GCTTCCCTGG TGCCCGAGGC CCCTCTGGAC CCCAGGGCCC CGGCGGCCCT
CCTGGTCCCA AGGGTAACAG CGGTGAACCT GGTGCTCCTG GCAGCAAAGG AGACACTGGT
GCTAAGGGAG AGCCTGGCCC TGTTGGTGTT CAAGGACCCC CTGGCCCTGC TGGAGAGGAA
GGAAAGCGAG GAGCTCGAGG TGAACCCGGA CCCACTGGCC TGCCCGGACC CCCTGGCGAG
CGTGGTGGAC CTGGTAGCCG TGGTTTCCCT GGCGCAGATG GTGTTGCTGG TCCCAAGGGT
CCCGCTGGTG AACGTGGTTC TCCTGGCCCT GCTGGCCCCA AAGGATCTCC TGGTGAAGCT
GGTCGTCCCG GTGAAGCTGG TCTGCCTGGT GCCAAGGGTC TGACTGGAAG CCCTGGCAGC
CCTGGTCCTG ATGGCAAAAC TGGCCCCCCT GGTCCCGCCG GTCAAGATGG TCGCCCCGGA
CCCCCAGGCC CACCTGGTGC CCGTGGTCAG GCTGGTGTGA TGGGATTCCC TGGACCTAAA
GGTGCTGCTG GAGAGCCCGG CAAGGCTGGA GAGCGAGGTG TTCCCGGACC CCCTGGCGCT
GTCGGTCCTG CTGGCAAAGA TGGAGAGGCT GGAGCTCAGG GACCCCCTGG CCCTGCTGGT
CCCGCTGGCG AGAGAGGTGA ACAAGGCCCT GCTGGCTCCC CCGGATTCCA GGGTCTCCCT
GGTCCTGCTG GTCCTCCAGG TGAAGCAGGC AAACCTGGTG AACAGGGTGT TCCTGGAGAC
CTTGGCGCCC CTGGCCCCTC TGGAGCAAGA GGCGAGAGAG GTTTCCCTGG CGAGCGTGGT
GTGCAAGGTC CCCCTGGTCC TGCTGGTCCC CGAGGGGCCA ACGGTGCTCC CGGCAACGAT
GGTGCTAAGG GTGATGCTGG TGCCCCTGGA GCTCCCGGTA GCCAGGGCGC CCCTGGCCTT
CAGGGAATGC CTGGTGAACG TGGTGCAGCT GGTCTTCCAG GGCCTAAGGG TGACAGAGGT
GATGCTGGTC CCAAAGGTGC TGATGGCTCT CCTGGCAAAG ATGGCGTCCG TGGTCTGACT
GGCCCCATTG GTCCTCCTGG CCCTGCTGGT GCCCCTGGTG ACAAGGGTGA AAGTGGTCCC
AGCGGCCCTG CTGGTCCCAC TGGAGCTCGT GGTGCCCCCG GAGACCGTGG TGAGCCTGGT
CCCCCCGGCC CTGCTGGCTT TGCTGGCCCC CCTGGTGCTG ACGGCCAACC TGGTGCTAAA
GGCGAACCTG GTGATGCTGG TGCTAAAGGC GATGCTGGTC CCCCTGGCCC TGCCGGACCC
GCTGGACCCC CTGGCCCCAT TGGTAATGTT GGTGCTCCTG GAGCCAAAGG TGCTCGCGGC
AGCGCTGGTC CCCCTGGTGC TACTGGTTTC CCTGGTGCTG CTGGCCGAGT CGGTCCTCCT
GGCCCCTCTG GAAATGCTGG ACCCCCTGGC CCTCCTGGTC CTGCTGGCAA AGAAGGCGGC
AAAGGTCCCC GTGGTGAGAC TGGCCCTGCT GGACGTCCTG GTGAAGTTGG TCCCCCTGGT
CCCCCTGGCC CTGCTGGCGA GAAAGGATCC CCTGGTGCTG ATGGTCCTGC TGGTGCTCCT
GGTACTCCCG GGCCTCAAGG TATTGCTGGA CAGCGTGGTG TGGTCGGCCT GCCTGGTCAG
AGAGGAGAGA GAGGCTTCCC TGGTCTTCCT GGCCCCTCTG GTGAACCTGG CAAACAAGGT
CCCTCTGGAG CAAGTGGTGA ACGTGGTCCC CCTGGTCCCA TGGGCCCCCC TGGATTGGCT
GGACCCCCTG GTGAATCTGG ACGTGAGGGG GCTCCTGGTG CCGAAGGTTC CCCTGGACGA
GACGGTTCTC CTGGCGCCAA GGGTGACCGT GGTGAGACCG GCCCCGCTGG ACCCCCTGGT
GCTCCTGGTG CTCCTGGTGC CCCTGGCCCC GTTGGCCCTG CTGGCAAGAG TGGTGATCGT
GGTGAGACTG GTCCTGCTGG TCCCGCCGGT CCTGTCGGCC CTGTTGGCGC CCGTGGCCCC
GCCGGACCCC AAGGCCCCCG TGGTGACAAG GGTGAGACAG GCGAACAGGG CGACAGAGGC
ATAAAGGGTC ACCGTGGCTT CTCTGGCCTC CAGGGTCCCC CTGGCCCTCC TGGCTCTCCT
GGTGAACAAG GTCCCTCTGG AGCCTCTGGT CCTGCTGGTC CCCGAGGTCC CCCTGGCTCT
GCTGGTGCTC CTGGCAAAGA TGGACTCAAC GGTCTCCCTG GCCCCATTGG GCCCCCTGGT
CCTCGCGGTC GCACTGGTGA TGCTGGTCCT GTTGGTCCCC CCGGCCCTCC TGGACCTCCT
GGTCCCCCTG GTCCTCCCAG CGCTGGTTTC GACTTCAGCT TCCTGCCCCA GCCACCTCAA
GAGAAGGCTC ACGATGGTGG CCGCTACTAC CGGGCTGATG ATGCCAATGT GGTTCGTGAC
CGTGACCTCG AGGTGGACAC CACCCTCAAG AGCCTGAGCC AGCAGATCGA GAACATCCGG
AGCCCAGAGG GCAGCCGCAA GAACCCCGCC CGCACCTGCC GTGACCTCAA GATGTGCCAC
TCTGACTGGA AGAGTGGAGA GTACTGGATT GACCCCAACC AAGGCTGCAA CCTGGATGCC
ATCAAAGTCT TCTGCAACAT GGAGACTGGT GAGACCTGCG TGTACCCCAC TCAGCCCAGT
GTGGCCCAGA AGAACTGGTA CATCAGCAAG AACCCCAAGG ACAAGAGGCA TGTCTGGTTC
GGCGAGAGCA TGACCGATGG ATTCCAGTTC GAGTATGGCG GCCAGGGCTC CGACCCTGCC
GATGTGGCCA TCCAGCTGAC CTTCCTGCGC CTGATGTCCA CCGAGGCCTC CCAGAACATC
ACCTACCACT GCAAGAACAG CGTGGCCTAC ATGGACCAGC AGACTGGCAA CCTCAAGAAG
GCCCTGCTCC TCCAGGGCTC CAACGAGATC GAGATCCGCG CCGAGGGCAA CAGCCGCTTC
ACCTACAGCG TCACTGTCGA TGGCTGCACG AGTCACACCG GAGCCTGGGG CAAGACAGTG
ATTGAATACA AAACCACCAA GACCTCCCGC CTGCCCATCA TCGATGTGGC CCCCTTGGAC
GTTGGTGCCC CAGACCAGGA ATTCGGCTTC GACGTTGGCC CTGTCTGCTT CCTGTAA

The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

RefSeq XP_011522643.1
CDS136..4332
Misc Feature(1)253..420(+)
Misc Feature(2)841..1020(+)
Misc Feature(3)967..1146(+)
Misc Feature(4)1057..1299(+)
Misc Feature(5)1291..1542(+)
Misc Feature(6)1462..1623(+)
Misc Feature(7)1543..1758(+)
Misc Feature(8)1939..2112(+)
Misc Feature(9)2038..2214(+)
Misc Feature(10)3619..4329(+)
Translation

Target ORF information:

RefSeq Version XM_011524341
Organism Homo sapiens (human)
Definition PREDICTED: Homo sapiens collagen, type I, alpha 1 (COL1A1), transcript variant X1, mRNA.

Target ORF information:

Epitope DYKDDDDK
Bacterial selection AMPR
Mammalian selection NeoR
Vector pcDNA3.1+/C-(K)DYK
XM_011524341

ORF Insert Sequence:

1
61
121
181
241
301
361
421
481
541
601
661
721
781
841
901
961
1021
1081
1141
1201
1261
1321
1381
1441
1501
1561
1621
1681
1741
1801
1861
1921
1981
2041
2101
2161
2221
2281
2341
2401
2461
2521
2581
2641
2701
2761
2821
2881
2941
3001
3061
3121
3181
3241
3301
3361
3421
3481
3541
3601
3661
3721
3781
3841
3901
3961
4021
4081
4141
ATGTTCAGCT TTGTGGACCT CCGGCTCCTG CTCCTCTTAG CGGCCACCGC CCTCCTGACG 
CACGGCCAAG AGGAAGGCCA AGTCGAGGGC CAAGACGAAG ACATCCCACC AATCACCTGC
GTACAGAACG GCCTCAGGTA CCATGACCGA GACGTGTGGA AACCCGAGCC CTGCCGGATC
TGCGTCTGCG ACAACGGCAA GGTGTTGTGC GATGACGTGA TCTGTGACGA GACCAAGAAC
TGCCCCGGCG CCGAAGTCCC CGAGGGCGAG TGCTGTCCCG TCTGCCCCGA CGGCTCAGAG
TCACCCACCG ACCAAGAAAC CACCGGCGTC GAGGGACCCA AGGGAGACAC TGGCCCCCGA
GGCCCAAGGG GACCCGCAGG CCCCCCTGGC CGAGATGGCA TCCCTGGACA GCCTGGACTT
CCCGGACCCC CCGGACCCCC CGGACCTCCC GGACCCCCTG GCCTCGGAGG AAACTTTGCT
CCCCAGCTGT CTTATGGCTA TGATGAGAAA TCAACCGGAG GAATTTCCGT GCCTGGCCCC
ATGGGTCCCT CTGGTCCTCG TGGTCTCCCT GGCCCCCCTG GTGCACCTGG TCCCCAAGGC
TTCCAAGGTC CCCCTGGTGA GCCTGGCGAG CCTGGAGCTT CAGGTCCCAT GGGTCCCCGA
GGTCCCCCAG GTCCCCCTGG AAAGAATGGA GATGATGGGG AAGCTGGAAA ACCTGGTCGT
CCTGGTGAGC GTGGGCCTCC TGGGCCTCAG GGTGCTCGAG GATTGCCCGG AACAGCTGGC
CTCCCTGGAA TGAAGGGACA CAGAGGTTTC AGTGGTTTGG ATGGTGCCAA GGGAGATGCT
GGTCCTGCTG GTCCTAAGGG TGAGCCTGGC AGCCCTGGTG AAAATGGAGC TCCTGGTCAG
ATGGGCCCCC GTGGCCTGCC TGGTGAGAGA GGTCGCCCTG GAGCCCCTGG CCCTGCTGGA
AACCCTGGTG CTGATGGACA GCCTGGTGCT AAAGGTGCCA ATGGTGCTCC TGGTATTGCT
GGTGCTCCTG GCTTCCCTGG TGCCCGAGGC CCCTCTGGAC CCCAGGGCCC CGGCGGCCCT
CCTGGTCCCA AGGGTAACAG CGGTGAACCT GGTGCTCCTG GCAGCAAAGG AGACACTGGT
GCTAAGGGAG AGCCTGGCCC TGTTGGTGTT CAAGGACCCC CTGGCCCTGC TGGAGAGGAA
GGAAAGCGAG GAGCTCGAGG TGAACCCGGA CCCACTGGCC TGCCCGGACC CCCTGGCGAG
CGTGGTGGAC CTGGTAGCCG TGGTTTCCCT GGCGCAGATG GTGTTGCTGG TCCCAAGGGT
CCCGCTGGTG AACGTGGTTC TCCTGGCCCT GCTGGCCCCA AAGGATCTCC TGGTGAAGCT
GGTCGTCCCG GTGAAGCTGG TCTGCCTGGT GCCAAGGGTC TGACTGGAAG CCCTGGCAGC
CCTGGTCCTG ATGGCAAAAC TGGCCCCCCT GGTCCCGCCG GTCAAGATGG TCGCCCCGGA
CCCCCAGGCC CACCTGGTGC CCGTGGTCAG GCTGGTGTGA TGGGATTCCC TGGACCTAAA
GGTGCTGCTG GAGAGCCCGG CAAGGCTGGA GAGCGAGGTG TTCCCGGACC CCCTGGCGCT
GTCGGTCCTG CTGGCAAAGA TGGAGAGGCT GGAGCTCAGG GACCCCCTGG CCCTGCTGGT
CCCGCTGGCG AGAGAGGTGA ACAAGGCCCT GCTGGCTCCC CCGGATTCCA GGGTCTCCCT
GGTCCTGCTG GTCCTCCAGG TGAAGCAGGC AAACCTGGTG AACAGGGTGT TCCTGGAGAC
CTTGGCGCCC CTGGCCCCTC TGGAGCAAGA GGCGAGAGAG GTTTCCCTGG CGAGCGTGGT
GTGCAAGGTC CCCCTGGTCC TGCTGGTCCC CGAGGGGCCA ACGGTGCTCC CGGCAACGAT
GGTGCTAAGG GTGATGCTGG TGCCCCTGGA GCTCCCGGTA GCCAGGGCGC CCCTGGCCTT
CAGGGAATGC CTGGTGAACG TGGTGCAGCT GGTCTTCCAG GGCCTAAGGG TGACAGAGGT
GATGCTGGTC CCAAAGGTGC TGATGGCTCT CCTGGCAAAG ATGGCGTCCG TGGTCTGACT
GGCCCCATTG GTCCTCCTGG CCCTGCTGGT GCCCCTGGTG ACAAGGGTGA AAGTGGTCCC
AGCGGCCCTG CTGGTCCCAC TGGAGCTCGT GGTGCCCCCG GAGACCGTGG TGAGCCTGGT
CCCCCCGGCC CTGCTGGCTT TGCTGGCCCC CCTGGTGCTG ACGGCCAACC TGGTGCTAAA
GGCGAACCTG GTGATGCTGG TGCTAAAGGC GATGCTGGTC CCCCTGGCCC TGCCGGACCC
GCTGGACCCC CTGGCCCCAT TGGTAATGTT GGTGCTCCTG GAGCCAAAGG TGCTCGCGGC
AGCGCTGGTC CCCCTGGTGC TACTGGTTTC CCTGGTGCTG CTGGCCGAGT CGGTCCTCCT
GGCCCCTCTG GAAATGCTGG ACCCCCTGGC CCTCCTGGTC CTGCTGGCAA AGAAGGCGGC
AAAGGTCCCC GTGGTGAGAC TGGCCCTGCT GGACGTCCTG GTGAAGTTGG TCCCCCTGGT
CCCCCTGGCC CTGCTGGCGA GAAAGGATCC CCTGGTGCTG ATGGTCCTGC TGGTGCTCCT
GGTACTCCCG GGCCTCAAGG TATTGCTGGA CAGCGTGGTG TGGTCGGCCT GCCTGGTCAG
AGAGGAGAGA GAGGCTTCCC TGGTCTTCCT GGCCCCTCTG GTGAACCTGG CAAACAAGGT
CCCTCTGGAG CAAGTGGTGA ACGTGGTCCC CCTGGTCCCA TGGGCCCCCC TGGATTGGCT
GGACCCCCTG GTGAATCTGG ACGTGAGGGG GCTCCTGGTG CCGAAGGTTC CCCTGGACGA
GACGGTTCTC CTGGCGCCAA GGGTGACCGT GGTGAGACCG GCCCCGCTGG ACCCCCTGGT
GCTCCTGGTG CTCCTGGTGC CCCTGGCCCC GTTGGCCCTG CTGGCAAGAG TGGTGATCGT
GGTGAGACTG GTCCTGCTGG TCCCGCCGGT CCTGTCGGCC CTGTTGGCGC CCGTGGCCCC
GCCGGACCCC AAGGCCCCCG TGGTGACAAG GGTGAGACAG GCGAACAGGG CGACAGAGGC
ATAAAGGGTC ACCGTGGCTT CTCTGGCCTC CAGGGTCCCC CTGGCCCTCC TGGCTCTCCT
GGTGAACAAG GTCCCTCTGG AGCCTCTGGT CCTGCTGGTC CCCGAGGTCC CCCTGGCTCT
GCTGGTGCTC CTGGCAAAGA TGGACTCAAC GGTCTCCCTG GCCCCATTGG GCCCCCTGGT
CCTCGCGGTC GCACTGGTGA TGCTGGTCCT GTTGGTCCCC CCGGCCCTCC TGGACCTCCT
GGTCCCCCTG GTCCTCCCAG CGCTGGTTTC GACTTCAGCT TCCTGCCCCA GCCACCTCAA
GAGAAGGCTC ACGATGGTGG CCGCTACTAC CGGGCTGATG ATGCCAATGT GGTTCGTGAC
CGTGACCTCG AGGTGGACAC CACCCTCAAG AGCCTGAGCC AGCAGATCGA GAACATCCGG
AGCCCAGAGG GCAGCCGCAA GAACCCCGCC CGCACCTGCC GTGACCTCAA GATGTGCCAC
TCTGACTGGA AGAGTGGAGA GTACTGGATT GACCCCAACC AAGGCTGCAA CCTGGATGCC
ATCAAAGTCT TCTGCAACAT GGAGACTGGT GAGACCTGCG TGTACCCCAC TCAGCCCAGT
GTGGCCCAGA AGAACTGGTA CATCAGCAAG AACCCCAAGG ACAAGAGGCA TGTCTGGTTC
GGCGAGAGCA TGACCGATGG ATTCCAGTTC GAGTATGGCG GCCAGGGCTC CGACCCTGCC
GATGTGGCCA TCCAGCTGAC CTTCCTGCGC CTGATGTCCA CCGAGGCCTC CCAGAACATC
ACCTACCACT GCAAGAACAG CGTGGCCTAC ATGGACCAGC AGACTGGCAA CCTCAAGAAG
GCCCTGCTCC TCCAGGGCTC CAACGAGATC GAGATCCGCG CCGAGGGCAA CAGCCGCTTC
ACCTACAGCG TCACTGTCGA TGGCTGCACG AGTCACACCG GAGCCTGGGG CAAGACAGTG
ATTGAATACA AAACCACCAA GACCTCCCGC CTGCCCATCA TCGATGTGGC CCCCTTGGAC
GTTGGTGCCC CAGACCAGGA ATTCGGCTTC GACGTTGGCC CTGTCTGCTT CCTGTAA

The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

CloneID OHu50114
Clone ID Related Accession (Same CDS sequence) XM_005257058
Accession Version XM_005257058.3 Documents for ORF clone product in dufault vector
Sequence Information ORF Nucleotide Sequence (Length: 4125bp)
Protein sequence
SNP
Vector pcDNA3.1+/C-(K)DYK or customized vector User Manual
Clone information Clone Map MSDS
Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
ORF Insert Method CloneEZ® Seamless cloning technology
Structure linear
Update Date 12-MAR-2015
Organism Homo sapiens (human)
Product collagen alpha-1(I) chain isoform X2
Comment MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NT_010783.16) annotated using gene prediction method: Gnomon, supported by mRNA and EST evidence. Also see: Documentation of NCBI's Annotation Process On Mar 12, 2015 this sequence version replaced gi:578830421. ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Status :: Full annotation Annotation Version :: Homo sapiens Annotation Release 107 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 6.2 Annotation Method :: Best-placed RefSeq; Gnomon Features Annotated :: Gene; mRNA; CDS; ncRNA ##Genome-Annotation-Data-END##

1
61
121
181
241
301
361
421
481
541
601
661
721
781
841
901
961
1021
1081
1141
1201
1261
1321
1381
1441
1501
1561
1621
1681
1741
1801
1861
1921
1981
2041
2101
2161
2221
2281
2341
2401
2461
2521
2581
2641
2701
2761
2821
2881
2941
3001
3061
3121
3181
3241
3301
3361
3421
3481
3541
3601
3661
3721
3781
3841
3901
3961
4021
4081
ATGTTCAGCT TTGTGGACCT CCGGCTCCTG CTCCTCTTAG CGGCCACCGC CCTCCTGACG 
CACGGCCAAG AGGAAGGCCA AGTCGAGGGC CAAGACGAAG ACATCCCACC AATCACCTGC
GTACAGAACG GCCTCAGGTA CCATGACCGA GACGTGTGGA AACCCGAGCC CTGCCGGATC
TGCGTCTGCG ACAACGGCAA GGTGTTGTGC GATGACGTGA TCTGTGACGA GACCAAGAAC
TGCCCCGGCG CCGAAGTCCC CGAGGGCGAG TGCTGTCCCG TCTGCCCCGA CGGCTCAGAG
TCACCCACCG ACCAAGAAAC CACCGGCGTC GAGGGACCCA AGGGAGACAC TGGCCCCCGA
GGCCCAAGGG GACCCGCAGG CCCCCCTGGC CGAGATGGCA TCCCTGGACA GCCTGGACTT
CCCGGACCCC CCGGACCCCC CGGACCTCCC GGACCCCCTG GCCTCGGAGG AAACTTTGCT
CCCCAGCTGT CTTATGGCTA TGATGAGAAA TCAACCGGAG GAATTTCCGT GCCTGGCCCC
ATGGGTCCCT CTGGTCCTCG TGGTCTCCCT GGCCCCCCTG GTGCACCTGG TCCCCAAGGC
TTCCAAGGTC CCCCTGGTGA GCCTGGCGAG CCTGGAGCTT CAGGTCCCAT GGGTCCCCGA
GGTCCCCCAG GTCCCCCTGG AAAGAATGGA GATGATGGGG AAGCTGGAAA ACCTGGTCGT
CCTGGTGAGC GTGGGCCTCC TGGGCCTCAG GGTGCTCGAG GATTGCCCGG AACAGCTGGC
CTCCCTGGAA TGAAGGGACA CAGAGGTTTC AGTGGTTTGG ATGGTGCCAA GGGAGATGCT
GGTCCTGCTG GTCCTAAGGG TGAGCCTGGC AGCCCTGGTG AAAATGGAGC TCCTGGTCAG
ATGGGCCCCC GTGGCCTGCC TGGTGAGAGA GGTCGCCCTG GAGCCCCTGG CCCTGCTGGT
GCTCGTGGAA ATGATGGTGC TACTGGTGCT GCCGGGCCCC CTGGTCCCAC CGGCCCCGCT
GGTCCTCCTG GCTTCCCTGG TGCTGTTGGT GCTAAGGGTG AAGCTGGTCC CCAAGGGCCC
CGAGGCTCTG AAGGTCCCCA GGGTGTGCGT GGTGAGCCTG GCCCCCCTGG CCCTGCTGGT
GCTGCTGGCC CTGCTGGAAA CCCTGGTGCT GATGGACAGC CTGGTGCTAA AGGTGCCAAT
GGTGCTCCTG GTATTGCTGG TGCTCCTGGC TTCCCTGGTG CCCGAGGCCC CTCTGGACCC
CAGGGCCCCG GCGGCCCTCC TGGTCCCAAG GGTAACAGCG GTGAACCTGG TGCTCCTGGC
AGCAAAGGAG ACACTGGTGC TAAGGGAGAG CCTGGCCCTG TTGGTGTTCA AGGACCCCCT
GGCCCTGCTG GAGAGGAAGG AAAGCGAGGA GCTCGAGGTG AACCCGGACC CACTGGCCTG
CCCGGACCCC CTGGCGAGCG TGGTGGACCT GGTAGCCGTG GTTTCCCTGG CGCAGATGGT
GTTGCTGGTC CCAAGGGTCC CGCTGGTGAA CGTGGTTCTC CTGGCCCTGC TGGCCCCAAA
GGATCTCCTG GTGAAGCTGG TCGTCCCGGT GAAGCTGGTC TGCCTGGTGC CAAGGGTCTG
ACTGGAAGCC CTGGCAGCCC TGGTCCTGAT GGCAAAACTG GCCCCCCTGG TCCCGCCGGT
CAAGATGGTC GCCCCGGACC CCCAGGCCCA CCTGGTGCCC GTGGTCAGGC TGGTGTGATG
GGATTCCCTG GACCTAAAGG TGCTGCTGGA GAGCCCGGCA AGGCTGGAGA GCGAGGTGTT
CCCGGACCCC CTGGCGCTGT CGGTCCTGCT GGCAAAGATG GAGAGGCTGG AGCTCAGGGA
CCCCCTGGCC CTGCTGGTCC CGCTGGCGAG AGAGGTGAAC AAGGCCCTGC TGGCTCCCCC
GGATTCCAGG GTCTCCCTGG TCCTGCTGGT CCTCCAGGTG AAGCAGGCAA ACCTGGTGAA
CAGGGTGTTC CTGGAGACCT TGGCGCCCCT GGCCCCTCTG GAGCAAGAGG CGAGAGAGGT
TTCCCTGGCG AGCGTGGTGT GCAAGGTCCC CCTGGTCCTG CTGGTCCCCG AGGGGCCAAC
GGTGCTCCCG GCAACGATGG TGCTAAGGGT GATGCTGGTG CCCCTGGAGC TCCCGGTAGC
CAGGGCGCCC CTGGCCTTCA GGGAATGCCT GGTGAACGTG GTGCAGCTGG TCTTCCAGGG
CCTAAGGGTG ACAGAGGTGA TGCTGGTCCC AAAGGTGCTG ATGGCTCTCC TGGCAAAGAT
GGCGTCCGTG GTCTGACTGG CCCCATTGGT CCTCCTGGCC CTGCTGGTGC CCCTGGTGAC
AAGGGTGAAA GTGGTCCCAG CGGCCCTGCT GGTCCCACTG GAGCTCGTGG TGCCCCCGGA
GACCGTGGTG AGCCTGGTCC CCCCGGCCCT GCTGGCTTTG CTGGCCCCCC TGGTGCTGAC
GGCCAACCTG GTGCTAAAGG CGAACCTGGT GATGCTGGTG CTAAAGGCGA TGCTGGTCCC
CCTGGCCCTG CCGGACCCGC TGGACCCCCT GGCCCCATTG GTAATGTTGG TGCTCCTGGA
GCCAAAGGTG CTCGCGGCAG CGCTGGTCCC CCTGGTGCTA CTGGTTTCCC TGGTGCTGCT
GGCCGAGTCG GTCCTCCTGG CCCCTCTGGT GAACCTGGCA AACAAGGTCC CTCTGGAGCA
AGTGGTGAAC GTGGTCCCCC TGGTCCCATG GGCCCCCCTG GATTGGCTGG ACCCCCTGGT
GAATCTGGAC GTGAGGGGGC TCCTGGTGCC GAAGGTTCCC CTGGACGAGA CGGTTCTCCT
GGCGCCAAGG GTGACCGTGG TGAGACCGGC CCCGCTGGAC CCCCTGGTGC TCCTGGTGCT
CCTGGTGCCC CTGGCCCCGT TGGCCCTGCT GGCAAGAGTG GTGATCGTGG TGAGACTGGT
CCTGCTGGTC CCGCCGGTCC TGTCGGCCCT GTTGGCGCCC GTGGCCCCGC CGGACCCCAA
GGCCCCCGTG GTGACAAGGG TGAGACAGGC GAACAGGGCG ACAGAGGCAT AAAGGGTCAC
CGTGGCTTCT CTGGCCTCCA GGGTCCCCCT GGCCCTCCTG GCTCTCCTGG TGAACAAGGT
CCCTCTGGAG CCTCTGGTCC TGCTGGTCCC CGAGGTCCCC CTGGCTCTGC TGGTGCTCCT
GGCAAAGATG GACTCAACGG TCTCCCTGGC CCCATTGGGC CCCCTGGTCC TCGCGGTCGC
ACTGGTGATG CTGGTCCTGT TGGTCCCCCC GGCCCTCCTG GACCTCCTGG TCCCCCTGGT
CCTCCCAGCG CTGGTTTCGA CTTCAGCTTC CTGCCCCAGC CACCTCAAGA GAAGGCTCAC
GATGGTGGCC GCTACTACCG GGCTGATGAT GCCAATGTGG TTCGTGACCG TGACCTCGAG
GTGGACACCA CCCTCAAGAG CCTGAGCCAG CAGATCGAGA ACATCCGGAG CCCAGAGGGC
AGCCGCAAGA ACCCCGCCCG CACCTGCCGT GACCTCAAGA TGTGCCACTC TGACTGGAAG
AGTGGAGAGT ACTGGATTGA CCCCAACCAA GGCTGCAACC TGGATGCCAT CAAAGTCTTC
TGCAACATGG AGACTGGTGA GACCTGCGTG TACCCCACTC AGCCCAGTGT GGCCCAGAAG
AACTGGTACA TCAGCAAGAA CCCCAAGGAC AAGAGGCATG TCTGGTTCGG CGAGAGCATG
ACCGATGGAT TCCAGTTCGA GTATGGCGGC CAGGGCTCCG ACCCTGCCGA TGTGGCCATC
CAGCTGACCT TCCTGCGCCT GATGTCCACC GAGGCCTCCC AGAACATCAC CTACCACTGC
AAGAACAGCG TGGCCTACAT GGACCAGCAG ACTGGCAACC TCAAGAAGGC CCTGCTCCTC
CAGGGCTCCA ACGAGATCGA GATCCGCGCC GAGGGCAACA GCCGCTTCAC CTACAGCGTC
ACTGTCGATG GCTGCACGAG TCACACCGGA GCCTGGGGCA AGACAGTGAT TGAATACAAA
ACCACCAAGA CCTCCCGCCT GCCCATCATC GATGTGGCCC CCTTGGACGT TGGTGCCCCA
GACCAGGAAT TCGGCTTCGA CGTTGGCCCT GTCTGCTTCC TGTAA

The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

RefSeq XP_005257115.2
CDS136..4260
Misc Feature(1)253..420(+)
Misc Feature(2)841..1020(+)
Misc Feature(3)949..1107(+)
Misc Feature(4)1489..1740(+)
Misc Feature(5)1660..1821(+)
Misc Feature(6)1741..1956(+)
Misc Feature(7)2137..2310(+)
Misc Feature(8)2236..2412(+)
Misc Feature(9)3547..4257(+)
Translation

Target ORF information:

RefSeq Version XM_005257058
Organism Homo sapiens (human)
Definition PREDICTED: Homo sapiens collagen, type I, alpha 1 (COL1A1), transcript variant X2, mRNA.

Target ORF information:

Epitope DYKDDDDK
Bacterial selection AMPR
Mammalian selection NeoR
Vector pcDNA3.1+/C-(K)DYK
XM_005257058

ORF Insert Sequence:

1
61
121
181
241
301
361
421
481
541
601
661
721
781
841
901
961
1021
1081
1141
1201
1261
1321
1381
1441
1501
1561
1621
1681
1741
1801
1861
1921
1981
2041
2101
2161
2221
2281
2341
2401
2461
2521
2581
2641
2701
2761
2821
2881
2941
3001
3061
3121
3181
3241
3301
3361
3421
3481
3541
3601
3661
3721
3781
3841
3901
3961
4021
4081
ATGTTCAGCT TTGTGGACCT CCGGCTCCTG CTCCTCTTAG CGGCCACCGC CCTCCTGACG 
CACGGCCAAG AGGAAGGCCA AGTCGAGGGC CAAGACGAAG ACATCCCACC AATCACCTGC
GTACAGAACG GCCTCAGGTA CCATGACCGA GACGTGTGGA AACCCGAGCC CTGCCGGATC
TGCGTCTGCG ACAACGGCAA GGTGTTGTGC GATGACGTGA TCTGTGACGA GACCAAGAAC
TGCCCCGGCG CCGAAGTCCC CGAGGGCGAG TGCTGTCCCG TCTGCCCCGA CGGCTCAGAG
TCACCCACCG ACCAAGAAAC CACCGGCGTC GAGGGACCCA AGGGAGACAC TGGCCCCCGA
GGCCCAAGGG GACCCGCAGG CCCCCCTGGC CGAGATGGCA TCCCTGGACA GCCTGGACTT
CCCGGACCCC CCGGACCCCC CGGACCTCCC GGACCCCCTG GCCTCGGAGG AAACTTTGCT
CCCCAGCTGT CTTATGGCTA TGATGAGAAA TCAACCGGAG GAATTTCCGT GCCTGGCCCC
ATGGGTCCCT CTGGTCCTCG TGGTCTCCCT GGCCCCCCTG GTGCACCTGG TCCCCAAGGC
TTCCAAGGTC CCCCTGGTGA GCCTGGCGAG CCTGGAGCTT CAGGTCCCAT GGGTCCCCGA
GGTCCCCCAG GTCCCCCTGG AAAGAATGGA GATGATGGGG AAGCTGGAAA ACCTGGTCGT
CCTGGTGAGC GTGGGCCTCC TGGGCCTCAG GGTGCTCGAG GATTGCCCGG AACAGCTGGC
CTCCCTGGAA TGAAGGGACA CAGAGGTTTC AGTGGTTTGG ATGGTGCCAA GGGAGATGCT
GGTCCTGCTG GTCCTAAGGG TGAGCCTGGC AGCCCTGGTG AAAATGGAGC TCCTGGTCAG
ATGGGCCCCC GTGGCCTGCC TGGTGAGAGA GGTCGCCCTG GAGCCCCTGG CCCTGCTGGT
GCTCGTGGAA ATGATGGTGC TACTGGTGCT GCCGGGCCCC CTGGTCCCAC CGGCCCCGCT
GGTCCTCCTG GCTTCCCTGG TGCTGTTGGT GCTAAGGGTG AAGCTGGTCC CCAAGGGCCC
CGAGGCTCTG AAGGTCCCCA GGGTGTGCGT GGTGAGCCTG GCCCCCCTGG CCCTGCTGGT
GCTGCTGGCC CTGCTGGAAA CCCTGGTGCT GATGGACAGC CTGGTGCTAA AGGTGCCAAT
GGTGCTCCTG GTATTGCTGG TGCTCCTGGC TTCCCTGGTG CCCGAGGCCC CTCTGGACCC
CAGGGCCCCG GCGGCCCTCC TGGTCCCAAG GGTAACAGCG GTGAACCTGG TGCTCCTGGC
AGCAAAGGAG ACACTGGTGC TAAGGGAGAG CCTGGCCCTG TTGGTGTTCA AGGACCCCCT
GGCCCTGCTG GAGAGGAAGG AAAGCGAGGA GCTCGAGGTG AACCCGGACC CACTGGCCTG
CCCGGACCCC CTGGCGAGCG TGGTGGACCT GGTAGCCGTG GTTTCCCTGG CGCAGATGGT
GTTGCTGGTC CCAAGGGTCC CGCTGGTGAA CGTGGTTCTC CTGGCCCTGC TGGCCCCAAA
GGATCTCCTG GTGAAGCTGG TCGTCCCGGT GAAGCTGGTC TGCCTGGTGC CAAGGGTCTG
ACTGGAAGCC CTGGCAGCCC TGGTCCTGAT GGCAAAACTG GCCCCCCTGG TCCCGCCGGT
CAAGATGGTC GCCCCGGACC CCCAGGCCCA CCTGGTGCCC GTGGTCAGGC TGGTGTGATG
GGATTCCCTG GACCTAAAGG TGCTGCTGGA GAGCCCGGCA AGGCTGGAGA GCGAGGTGTT
CCCGGACCCC CTGGCGCTGT CGGTCCTGCT GGCAAAGATG GAGAGGCTGG AGCTCAGGGA
CCCCCTGGCC CTGCTGGTCC CGCTGGCGAG AGAGGTGAAC AAGGCCCTGC TGGCTCCCCC
GGATTCCAGG GTCTCCCTGG TCCTGCTGGT CCTCCAGGTG AAGCAGGCAA ACCTGGTGAA
CAGGGTGTTC CTGGAGACCT TGGCGCCCCT GGCCCCTCTG GAGCAAGAGG CGAGAGAGGT
TTCCCTGGCG AGCGTGGTGT GCAAGGTCCC CCTGGTCCTG CTGGTCCCCG AGGGGCCAAC
GGTGCTCCCG GCAACGATGG TGCTAAGGGT GATGCTGGTG CCCCTGGAGC TCCCGGTAGC
CAGGGCGCCC CTGGCCTTCA GGGAATGCCT GGTGAACGTG GTGCAGCTGG TCTTCCAGGG
CCTAAGGGTG ACAGAGGTGA TGCTGGTCCC AAAGGTGCTG ATGGCTCTCC TGGCAAAGAT
GGCGTCCGTG GTCTGACTGG CCCCATTGGT CCTCCTGGCC CTGCTGGTGC CCCTGGTGAC
AAGGGTGAAA GTGGTCCCAG CGGCCCTGCT GGTCCCACTG GAGCTCGTGG TGCCCCCGGA
GACCGTGGTG AGCCTGGTCC CCCCGGCCCT GCTGGCTTTG CTGGCCCCCC TGGTGCTGAC
GGCCAACCTG GTGCTAAAGG CGAACCTGGT GATGCTGGTG CTAAAGGCGA TGCTGGTCCC
CCTGGCCCTG CCGGACCCGC TGGACCCCCT GGCCCCATTG GTAATGTTGG TGCTCCTGGA
GCCAAAGGTG CTCGCGGCAG CGCTGGTCCC CCTGGTGCTA CTGGTTTCCC TGGTGCTGCT
GGCCGAGTCG GTCCTCCTGG CCCCTCTGGT GAACCTGGCA AACAAGGTCC CTCTGGAGCA
AGTGGTGAAC GTGGTCCCCC TGGTCCCATG GGCCCCCCTG GATTGGCTGG ACCCCCTGGT
GAATCTGGAC GTGAGGGGGC TCCTGGTGCC GAAGGTTCCC CTGGACGAGA CGGTTCTCCT
GGCGCCAAGG GTGACCGTGG TGAGACCGGC CCCGCTGGAC CCCCTGGTGC TCCTGGTGCT
CCTGGTGCCC CTGGCCCCGT TGGCCCTGCT GGCAAGAGTG GTGATCGTGG TGAGACTGGT
CCTGCTGGTC CCGCCGGTCC TGTCGGCCCT GTTGGCGCCC GTGGCCCCGC CGGACCCCAA
GGCCCCCGTG GTGACAAGGG TGAGACAGGC GAACAGGGCG ACAGAGGCAT AAAGGGTCAC
CGTGGCTTCT CTGGCCTCCA GGGTCCCCCT GGCCCTCCTG GCTCTCCTGG TGAACAAGGT
CCCTCTGGAG CCTCTGGTCC TGCTGGTCCC CGAGGTCCCC CTGGCTCTGC TGGTGCTCCT
GGCAAAGATG GACTCAACGG TCTCCCTGGC CCCATTGGGC CCCCTGGTCC TCGCGGTCGC
ACTGGTGATG CTGGTCCTGT TGGTCCCCCC GGCCCTCCTG GACCTCCTGG TCCCCCTGGT
CCTCCCAGCG CTGGTTTCGA CTTCAGCTTC CTGCCCCAGC CACCTCAAGA GAAGGCTCAC
GATGGTGGCC GCTACTACCG GGCTGATGAT GCCAATGTGG TTCGTGACCG TGACCTCGAG
GTGGACACCA CCCTCAAGAG CCTGAGCCAG CAGATCGAGA ACATCCGGAG CCCAGAGGGC
AGCCGCAAGA ACCCCGCCCG CACCTGCCGT GACCTCAAGA TGTGCCACTC TGACTGGAAG
AGTGGAGAGT ACTGGATTGA CCCCAACCAA GGCTGCAACC TGGATGCCAT CAAAGTCTTC
TGCAACATGG AGACTGGTGA GACCTGCGTG TACCCCACTC AGCCCAGTGT GGCCCAGAAG
AACTGGTACA TCAGCAAGAA CCCCAAGGAC AAGAGGCATG TCTGGTTCGG CGAGAGCATG
ACCGATGGAT TCCAGTTCGA GTATGGCGGC CAGGGCTCCG ACCCTGCCGA TGTGGCCATC
CAGCTGACCT TCCTGCGCCT GATGTCCACC GAGGCCTCCC AGAACATCAC CTACCACTGC
AAGAACAGCG TGGCCTACAT GGACCAGCAG ACTGGCAACC TCAAGAAGGC CCTGCTCCTC
CAGGGCTCCA ACGAGATCGA GATCCGCGCC GAGGGCAACA GCCGCTTCAC CTACAGCGTC
ACTGTCGATG GCTGCACGAG TCACACCGGA GCCTGGGGCA AGACAGTGAT TGAATACAAA
ACCACCAAGA CCTCCCGCCT GCCCATCATC GATGTGGCCC CCTTGGACGT TGGTGCCCCA
GACCAGGAAT TCGGCTTCGA CGTTGGCCCT GTCTGCTTCC TGTAA

The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

CloneID OHu50115
Clone ID Related Accession (Same CDS sequence) XM_005257059
Accession Version XM_005257059.3 Documents for ORF clone product in dufault vector
Sequence Information ORF Nucleotide Sequence (Length: 3477bp)
Protein sequence
SNP
Vector pcDNA3.1+/C-(K)DYK or customized vector User Manual
Clone information Clone Map MSDS
Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
ORF Insert Method CloneEZ® Seamless cloning technology
Structure linear
Update Date 12-MAR-2015
Organism Homo sapiens (human)
Product collagen alpha-1(I) chain isoform X3
Comment MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NT_010783.16) annotated using gene prediction method: Gnomon, supported by mRNA and EST evidence. Also see: Documentation of NCBI's Annotation Process On Mar 12, 2015 this sequence version replaced gi:578830423. ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Status :: Full annotation Annotation Version :: Homo sapiens Annotation Release 107 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 6.2 Annotation Method :: Best-placed RefSeq; Gnomon Features Annotated :: Gene; mRNA; CDS; ncRNA ##Genome-Annotation-Data-END##

1
61
121
181
241
301
361
421
481
541
601
661
721
781
841
901
961
1021
1081
1141
1201
1261
1321
1381
1441
1501
1561
1621
1681
1741
1801
1861
1921
1981
2041
2101
2161
2221
2281
2341
2401
2461
2521
2581
2641
2701
2761
2821
2881
2941
3001
3061
3121
3181
3241
3301
3361
3421
ATGTTCAGCT TTGTGGACCT CCGGCTCCTG CTCCTCTTAG CGGCCACCGC CCTCCTGACG 
CACGGCCAAG AGGAAGGCCA AGTCGAGGGC CAAGACGAAG ACATCCCACC AATCACCTGC
GTACAGAACG GCCTCAGGTA CCATGACCGA GACGTGTGGA AACCCGAGCC CTGCCGGATC
TGCGTCTGCG ACAACGGCAA GGTGTTGTGC GATGACGTGA TCTGTGACGA GACCAAGAAC
TGCCCCGGCG CCGAAGTCCC CGAGGGCGAG TGCTGTCCCG TCTGCCCCGA CGGCTCAGAG
TCACCCACCG ACCAAGAAAC CACCGGCGTC GAGGGACCCA AGGGAGACAC TGGCCCCCGA
GGCCCAAGGG GACCCGCAGG CCCCCCTGGC CGAGATGGCA TCCCTGGACA GCCTGGACTT
CCCGGACCCC CCGGACCCCC CGGACCTCCC GGACCCCCTG GCCTCGGAGG AAACTTTGCT
CCCCAGCTGT CTTATGGCTA TGATGAGAAA TCAACCGGAG GAATTTCCGT GCCTGGCCCC
ATGGGTCCCT CTGGTCCTCG TGGTCTCCCT GGCCCCCCTG GTGCACCTGG TCCCCAAGGC
TTCCAAGGTC CCCCTGGTGA GCCTGGCGAG CCTGGAGCTT CAGGTCCCAT GGGTCCCCGA
GGTCCCCCAG GTCCCCCTGG AAAGAATGGA GATGATGGGG AAGCTGGAAA ACCTGGTCGT
CCTGGTGAGC GTGGGCCTCC TGGGCCTCAG GGTGCTCGAG GATTGCCCGG AACAGCTGGC
CTCCCTGGAA TGAAGGGACA CAGAGGTTTC AGTGGTTTGG ATGGTGCCAA GGGAGATGCT
GGTCCTGCTG GTCCTAAGGG TGAGCCTGGC AGCCCTGGTG AAAATGGAGC TCCTGGTCAG
ATGGGCCCCC GTGGCCTGCC TGGTGAGAGA GGTCGCCCTG GAGCCCCTGG CCCTGCTGGT
CCCGCTGGCG AGAGAGGTGA ACAAGGCCCT GCTGGCTCCC CCGGATTCCA GGGTCTCCCT
GGTCCTGCTG GTCCTCCAGG TGAAGCAGGC AAACCTGGTG AACAGGGTGT TCCTGGAGAC
CTTGGCGCCC CTGGCCCCTC TGGAGCAAGA GGCGAGAGAG GTTTCCCTGG CGAGCGTGGT
GTGCAAGGTC CCCCTGGTCC TGCTGGTCCC CGAGGGGCCA ACGGTGCTCC CGGCAACGAT
GGTGCTAAGG GTGATGCTGG TGCCCCTGGA GCTCCCGGTA GCCAGGGCGC CCCTGGCCTT
CAGGGAATGC CTGGTGAACG TGGTGCAGCT GGTCTTCCAG GGCCTAAGGG TGACAGAGGT
GATGCTGGTC CCAAAGGTGC TGATGGCTCT CCTGGCAAAG ATGGCGTCCG TGGTCTGACT
GGCCCCATTG GTCCTCCTGG CCCTGCTGGT GCCCCTGGTG ACAAGGGTGA AAGTGGTCCC
AGCGGCCCTG CTGGTCCCAC TGGAGCTCGT GGTGCCCCCG GAGACCGTGG TGAGCCTGGT
CCCCCCGGCC CTGCTGGCTT TGCTGGCCCC CCTGGTGCTG ACGGCCAACC TGGTGCTAAA
GGCGAACCTG GTGATGCTGG TGCTAAAGGC GATGCTGGTC CCCCTGGCCC TGCCGGACCC
GCTGGACCCC CTGGCCCCAT TGGTAATGTT GGTGCTCCTG GAGCCAAAGG TGCTCGCGGC
AGCGCTGGTC CCCCTGGTGC TACTGGTTTC CCTGGTGCTG CTGGCCGAGT CGGTCCTCCT
GGCCCCTCTG GAAATGCTGG ACCCCCTGGC CCTCCTGGTC CTGCTGGCAA AGAAGGCGGC
AAAGGTCCCC GTGGTGAGAC TGGCCCTGCT GGACGTCCTG GTGAAGTTGG TCCCCCTGGT
CCCCCTGGCC CTGCTGGCGA GAAAGGATCC CCTGGTGCTG ATGGTCCTGC TGGTGCTCCT
GGTACTCCCG GGCCTCAAGG TATTGCTGGA CAGCGTGGTG TGGTCGGCCT GCCTGGTCAG
AGAGGAGAGA GAGGCTTCCC TGGTCTTCCT GGCCCCTCTG GTGAACCTGG CAAACAAGGT
CCCTCTGGAG CAAGTGGTGA ACGTGGTCCC CCTGGTCCCA TGGGCCCCCC TGGATTGGCT
GGACCCCCTG GTGAATCTGG ACGTGAGGGG GCTCCTGGTG CCGAAGGTTC CCCTGGACGA
GACGGTTCTC CTGGCGCCAA GGGTGACCGT GGTGAGACCG GCCCCGCTGG ACCCCCTGGT
GCTCCTGGTG CTCCTGGTGC CCCTGGCCCC GTTGGCCCTG CTGGCAAGAG TGGTGATCGT
GGTGAGACTG GTCCTGCTGG TCCCGCCGGT CCTGTCGGCC CTGTTGGCGC CCGTGGCCCC
GCCGGACCCC AAGGCCCCCG TGGTGACAAG GGTGAGACAG GCGAACAGGG CGACAGAGGC
ATAAAGGGTC ACCGTGGCTT CTCTGGCCTC CAGGGTCCCC CTGGCCCTCC TGGCTCTCCT
GGTGAACAAG GTCCCTCTGG AGCCTCTGGT CCTGCTGGTC CCCGAGGTCC CCCTGGCTCT
GCTGGTGCTC CTGGCAAAGA TGGACTCAAC GGTCTCCCTG GCCCCATTGG GCCCCCTGGT
CCTCGCGGTC GCACTGGTGA TGCTGGTCCT GTTGGTCCCC CCGGCCCTCC TGGACCTCCT
GGTCCCCCTG GTCCTCCCAG CGCTGGTTTC GACTTCAGCT TCCTGCCCCA GCCACCTCAA
GAGAAGGCTC ACGATGGTGG CCGCTACTAC CGGGCTGATG ATGCCAATGT GGTTCGTGAC
CGTGACCTCG AGGTGGACAC CACCCTCAAG AGCCTGAGCC AGCAGATCGA GAACATCCGG
AGCCCAGAGG GCAGCCGCAA GAACCCCGCC CGCACCTGCC GTGACCTCAA GATGTGCCAC
TCTGACTGGA AGAGTGGAGA GTACTGGATT GACCCCAACC AAGGCTGCAA CCTGGATGCC
ATCAAAGTCT TCTGCAACAT GGAGACTGGT GAGACCTGCG TGTACCCCAC TCAGCCCAGT
GTGGCCCAGA AGAACTGGTA CATCAGCAAG AACCCCAAGG ACAAGAGGCA TGTCTGGTTC
GGCGAGAGCA TGACCGATGG ATTCCAGTTC GAGTATGGCG GCCAGGGCTC CGACCCTGCC
GATGTGGCCA TCCAGCTGAC CTTCCTGCGC CTGATGTCCA CCGAGGCCTC CCAGAACATC
ACCTACCACT GCAAGAACAG CGTGGCCTAC ATGGACCAGC AGACTGGCAA CCTCAAGAAG
GCCCTGCTCC TCCAGGGCTC CAACGAGATC GAGATCCGCG CCGAGGGCAA CAGCCGCTTC
ACCTACAGCG TCACTGTCGA TGGCTGCACG AGTCACACCG GAGCCTGGGG CAAGACAGTG
ATTGAATACA AAACCACCAA GACCTCCCGC CTGCCCATCA TCGATGTGGC CCCCTTGGAC
GTTGGTGCCC CAGACCAGGA ATTCGGCTTC GACGTTGGCC CTGTCTGCTT CCTGTAA

The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

RefSeq XP_005257116.2
CDS135..3611
Misc Feature(1)252..419(+)
Misc Feature(2)840..1019(+)
Misc Feature(3)1218..1391(+)
Misc Feature(4)1317..1493(+)
Misc Feature(5)2898..3608(+)
Translation

Target ORF information:

RefSeq Version XM_005257059
Organism Homo sapiens (human)
Definition PREDICTED: Homo sapiens collagen, type I, alpha 1 (COL1A1), transcript variant X3, mRNA.

Target ORF information:

Epitope DYKDDDDK
Bacterial selection AMPR
Mammalian selection NeoR
Vector pcDNA3.1+/C-(K)DYK
XM_005257059

ORF Insert Sequence:

1
61
121
181
241
301
361
421
481
541
601
661
721
781
841
901
961
1021
1081
1141
1201
1261
1321
1381
1441
1501
1561
1621
1681
1741
1801
1861
1921
1981
2041
2101
2161
2221
2281
2341
2401
2461
2521
2581
2641
2701
2761
2821
2881
2941
3001
3061
3121
3181
3241
3301
3361
3421
ATGTTCAGCT TTGTGGACCT CCGGCTCCTG CTCCTCTTAG CGGCCACCGC CCTCCTGACG 
CACGGCCAAG AGGAAGGCCA AGTCGAGGGC CAAGACGAAG ACATCCCACC AATCACCTGC
GTACAGAACG GCCTCAGGTA CCATGACCGA GACGTGTGGA AACCCGAGCC CTGCCGGATC
TGCGTCTGCG ACAACGGCAA GGTGTTGTGC GATGACGTGA TCTGTGACGA GACCAAGAAC
TGCCCCGGCG CCGAAGTCCC CGAGGGCGAG TGCTGTCCCG TCTGCCCCGA CGGCTCAGAG
TCACCCACCG ACCAAGAAAC CACCGGCGTC GAGGGACCCA AGGGAGACAC TGGCCCCCGA
GGCCCAAGGG GACCCGCAGG CCCCCCTGGC CGAGATGGCA TCCCTGGACA GCCTGGACTT
CCCGGACCCC CCGGACCCCC CGGACCTCCC GGACCCCCTG GCCTCGGAGG AAACTTTGCT
CCCCAGCTGT CTTATGGCTA TGATGAGAAA TCAACCGGAG GAATTTCCGT GCCTGGCCCC
ATGGGTCCCT CTGGTCCTCG TGGTCTCCCT GGCCCCCCTG GTGCACCTGG TCCCCAAGGC
TTCCAAGGTC CCCCTGGTGA GCCTGGCGAG CCTGGAGCTT CAGGTCCCAT GGGTCCCCGA
GGTCCCCCAG GTCCCCCTGG AAAGAATGGA GATGATGGGG AAGCTGGAAA ACCTGGTCGT
CCTGGTGAGC GTGGGCCTCC TGGGCCTCAG GGTGCTCGAG GATTGCCCGG AACAGCTGGC
CTCCCTGGAA TGAAGGGACA CAGAGGTTTC AGTGGTTTGG ATGGTGCCAA GGGAGATGCT
GGTCCTGCTG GTCCTAAGGG TGAGCCTGGC AGCCCTGGTG AAAATGGAGC TCCTGGTCAG
ATGGGCCCCC GTGGCCTGCC TGGTGAGAGA GGTCGCCCTG GAGCCCCTGG CCCTGCTGGT
CCCGCTGGCG AGAGAGGTGA ACAAGGCCCT GCTGGCTCCC CCGGATTCCA GGGTCTCCCT
GGTCCTGCTG GTCCTCCAGG TGAAGCAGGC AAACCTGGTG AACAGGGTGT TCCTGGAGAC
CTTGGCGCCC CTGGCCCCTC TGGAGCAAGA GGCGAGAGAG GTTTCCCTGG CGAGCGTGGT
GTGCAAGGTC CCCCTGGTCC TGCTGGTCCC CGAGGGGCCA ACGGTGCTCC CGGCAACGAT
GGTGCTAAGG GTGATGCTGG TGCCCCTGGA GCTCCCGGTA GCCAGGGCGC CCCTGGCCTT
CAGGGAATGC CTGGTGAACG TGGTGCAGCT GGTCTTCCAG GGCCTAAGGG TGACAGAGGT
GATGCTGGTC CCAAAGGTGC TGATGGCTCT CCTGGCAAAG ATGGCGTCCG TGGTCTGACT
GGCCCCATTG GTCCTCCTGG CCCTGCTGGT GCCCCTGGTG ACAAGGGTGA AAGTGGTCCC
AGCGGCCCTG CTGGTCCCAC TGGAGCTCGT GGTGCCCCCG GAGACCGTGG TGAGCCTGGT
CCCCCCGGCC CTGCTGGCTT TGCTGGCCCC CCTGGTGCTG ACGGCCAACC TGGTGCTAAA
GGCGAACCTG GTGATGCTGG TGCTAAAGGC GATGCTGGTC CCCCTGGCCC TGCCGGACCC
GCTGGACCCC CTGGCCCCAT TGGTAATGTT GGTGCTCCTG GAGCCAAAGG TGCTCGCGGC
AGCGCTGGTC CCCCTGGTGC TACTGGTTTC CCTGGTGCTG CTGGCCGAGT CGGTCCTCCT
GGCCCCTCTG GAAATGCTGG ACCCCCTGGC CCTCCTGGTC CTGCTGGCAA AGAAGGCGGC
AAAGGTCCCC GTGGTGAGAC TGGCCCTGCT GGACGTCCTG GTGAAGTTGG TCCCCCTGGT
CCCCCTGGCC CTGCTGGCGA GAAAGGATCC CCTGGTGCTG ATGGTCCTGC TGGTGCTCCT
GGTACTCCCG GGCCTCAAGG TATTGCTGGA CAGCGTGGTG TGGTCGGCCT GCCTGGTCAG
AGAGGAGAGA GAGGCTTCCC TGGTCTTCCT GGCCCCTCTG GTGAACCTGG CAAACAAGGT
CCCTCTGGAG CAAGTGGTGA ACGTGGTCCC CCTGGTCCCA TGGGCCCCCC TGGATTGGCT
GGACCCCCTG GTGAATCTGG ACGTGAGGGG GCTCCTGGTG CCGAAGGTTC CCCTGGACGA
GACGGTTCTC CTGGCGCCAA GGGTGACCGT GGTGAGACCG GCCCCGCTGG ACCCCCTGGT
GCTCCTGGTG CTCCTGGTGC CCCTGGCCCC GTTGGCCCTG CTGGCAAGAG TGGTGATCGT
GGTGAGACTG GTCCTGCTGG TCCCGCCGGT CCTGTCGGCC CTGTTGGCGC CCGTGGCCCC
GCCGGACCCC AAGGCCCCCG TGGTGACAAG GGTGAGACAG GCGAACAGGG CGACAGAGGC
ATAAAGGGTC ACCGTGGCTT CTCTGGCCTC CAGGGTCCCC CTGGCCCTCC TGGCTCTCCT
GGTGAACAAG GTCCCTCTGG AGCCTCTGGT CCTGCTGGTC CCCGAGGTCC CCCTGGCTCT
GCTGGTGCTC CTGGCAAAGA TGGACTCAAC GGTCTCCCTG GCCCCATTGG GCCCCCTGGT
CCTCGCGGTC GCACTGGTGA TGCTGGTCCT GTTGGTCCCC CCGGCCCTCC TGGACCTCCT
GGTCCCCCTG GTCCTCCCAG CGCTGGTTTC GACTTCAGCT TCCTGCCCCA GCCACCTCAA
GAGAAGGCTC ACGATGGTGG CCGCTACTAC CGGGCTGATG ATGCCAATGT GGTTCGTGAC
CGTGACCTCG AGGTGGACAC CACCCTCAAG AGCCTGAGCC AGCAGATCGA GAACATCCGG
AGCCCAGAGG GCAGCCGCAA GAACCCCGCC CGCACCTGCC GTGACCTCAA GATGTGCCAC
TCTGACTGGA AGAGTGGAGA GTACTGGATT GACCCCAACC AAGGCTGCAA CCTGGATGCC
ATCAAAGTCT TCTGCAACAT GGAGACTGGT GAGACCTGCG TGTACCCCAC TCAGCCCAGT
GTGGCCCAGA AGAACTGGTA CATCAGCAAG AACCCCAAGG ACAAGAGGCA TGTCTGGTTC
GGCGAGAGCA TGACCGATGG ATTCCAGTTC GAGTATGGCG GCCAGGGCTC CGACCCTGCC
GATGTGGCCA TCCAGCTGAC CTTCCTGCGC CTGATGTCCA CCGAGGCCTC CCAGAACATC
ACCTACCACT GCAAGAACAG CGTGGCCTAC ATGGACCAGC AGACTGGCAA CCTCAAGAAG
GCCCTGCTCC TCCAGGGCTC CAACGAGATC GAGATCCGCG CCGAGGGCAA CAGCCGCTTC
ACCTACAGCG TCACTGTCGA TGGCTGCACG AGTCACACCG GAGCCTGGGG CAAGACAGTG
ATTGAATACA AAACCACCAA GACCTCCCGC CTGCCCATCA TCGATGTGGC CCCCTTGGAC
GTTGGTGCCC CAGACCAGGA ATTCGGCTTC GACGTTGGCC CTGTCTGCTT CCTGTAA

The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

CloneID OHu25462
Clone ID Related Accession (Same CDS sequence) NM_000088
Accession Version NM_000088.3 Documents for ORF clone product in dufault vector
Sequence Information ORF Nucleotide Sequence (Length: 4395bp)
Protein sequence
SNP
Vector pcDNA3.1+/C-(K)DYK or customized vector User Manual
Clone information Clone Map MSDS
Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
ORF Insert Method CloneEZ® Seamless cloning technology
Structure linear
Update Date 21-MAY-2015
Organism Homo sapiens (human)
Product collagen alpha-1(I) chain preproprotein
Comment REVIEWED REFSEQ: This record has been curated by NCBI staff. The reference sequence was derived from AC015909.14 and Z74615.1. This sequence is a reference standard in the RefSeqGene project. On Jul 14, 2006 this sequence version replaced gi:14719826. Publication Note: This RefSeq record includes a subset of the publications that are available for this gene. Please see the Gene record to access additional publications. ##Evidence-Data-START## Transcript exon combination :: Z74615.1, BC036531.2 [ECO:0000332] RNAseq introns :: single sample supports all introns SAMEA1968189, SAMEA1968540 [ECO:0000348] ##Evidence-Data-END## COMPLETENESS: full length.

1
61
121
181
241
301
361
421
481
541
601
661
721
781
841
901
961
1021
1081
1141
1201
1261
1321
1381
1441
1501
1561
1621
1681
1741
1801
1861
1921
1981
2041
2101
2161
2221
2281
2341
2401
2461
2521
2581
2641
2701
2761
2821
2881
2941
3001
3061
3121
3181
3241
3301
3361
3421
3481
3541
3601
3661
3721
3781
3841
3901
3961
4021
4081
4141
4201
4261
4321
4381
ATGTTCAGCT TTGTGGACCT CCGGCTCCTG CTCCTCTTAG CGGCCACCGC CCTCCTGACG 
CACGGCCAAG AGGAAGGCCA AGTCGAGGGC CAAGACGAAG ACATCCCACC AATCACCTGC
GTACAGAACG GCCTCAGGTA CCATGACCGA GACGTGTGGA AACCCGAGCC CTGCCGGATC
TGCGTCTGCG ACAACGGCAA GGTGTTGTGC GATGACGTGA TCTGTGACGA GACCAAGAAC
TGCCCCGGCG CCGAAGTCCC CGAGGGCGAG TGCTGTCCCG TCTGCCCCGA CGGCTCAGAG
TCACCCACCG ACCAAGAAAC CACCGGCGTC GAGGGACCCA AGGGAGACAC TGGCCCCCGA
GGCCCAAGGG GACCCGCAGG CCCCCCTGGC CGAGATGGCA TCCCTGGACA GCCTGGACTT
CCCGGACCCC CCGGACCCCC CGGACCTCCC GGACCCCCTG GCCTCGGAGG AAACTTTGCT
CCCCAGCTGT CTTATGGCTA TGATGAGAAA TCAACCGGAG GAATTTCCGT GCCTGGCCCC
ATGGGTCCCT CTGGTCCTCG TGGTCTCCCT GGCCCCCCTG GTGCACCTGG TCCCCAAGGC
TTCCAAGGTC CCCCTGGTGA GCCTGGCGAG CCTGGAGCTT CAGGTCCCAT GGGTCCCCGA
GGTCCCCCAG GTCCCCCTGG AAAGAATGGA GATGATGGGG AAGCTGGAAA ACCTGGTCGT
CCTGGTGAGC GTGGGCCTCC TGGGCCTCAG GGTGCTCGAG GATTGCCCGG AACAGCTGGC
CTCCCTGGAA TGAAGGGACA CAGAGGTTTC AGTGGTTTGG ATGGTGCCAA GGGAGATGCT
GGTCCTGCTG GTCCTAAGGG TGAGCCTGGC AGCCCTGGTG AAAATGGAGC TCCTGGTCAG
ATGGGCCCCC GTGGCCTGCC TGGTGAGAGA GGTCGCCCTG GAGCCCCTGG CCCTGCTGGT
GCTCGTGGAA ATGATGGTGC TACTGGTGCT GCCGGGCCCC CTGGTCCCAC CGGCCCCGCT
GGTCCTCCTG GCTTCCCTGG TGCTGTTGGT GCTAAGGGTG AAGCTGGTCC CCAAGGGCCC
CGAGGCTCTG AAGGTCCCCA GGGTGTGCGT GGTGAGCCTG GCCCCCCTGG CCCTGCTGGT
GCTGCTGGCC CTGCTGGAAA CCCTGGTGCT GATGGACAGC CTGGTGCTAA AGGTGCCAAT
GGTGCTCCTG GTATTGCTGG TGCTCCTGGC TTCCCTGGTG CCCGAGGCCC CTCTGGACCC
CAGGGCCCCG GCGGCCCTCC TGGTCCCAAG GGTAACAGCG GTGAACCTGG TGCTCCTGGC
AGCAAAGGAG ACACTGGTGC TAAGGGAGAG CCTGGCCCTG TTGGTGTTCA AGGACCCCCT
GGCCCTGCTG GAGAGGAAGG AAAGCGAGGA GCTCGAGGTG AACCCGGACC CACTGGCCTG
CCCGGACCCC CTGGCGAGCG TGGTGGACCT GGTAGCCGTG GTTTCCCTGG CGCAGATGGT
GTTGCTGGTC CCAAGGGTCC CGCTGGTGAA CGTGGTTCTC CTGGCCCTGC TGGCCCCAAA
GGATCTCCTG GTGAAGCTGG TCGTCCCGGT GAAGCTGGTC TGCCTGGTGC CAAGGGTCTG
ACTGGAAGCC CTGGCAGCCC TGGTCCTGAT GGCAAAACTG GCCCCCCTGG TCCCGCCGGT
CAAGATGGTC GCCCCGGACC CCCAGGCCCA CCTGGTGCCC GTGGTCAGGC TGGTGTGATG
GGATTCCCTG GACCTAAAGG TGCTGCTGGA GAGCCCGGCA AGGCTGGAGA GCGAGGTGTT
CCCGGACCCC CTGGCGCTGT CGGTCCTGCT GGCAAAGATG GAGAGGCTGG AGCTCAGGGA
CCCCCTGGCC CTGCTGGTCC CGCTGGCGAG AGAGGTGAAC AAGGCCCTGC TGGCTCCCCC
GGATTCCAGG GTCTCCCTGG TCCTGCTGGT CCTCCAGGTG AAGCAGGCAA ACCTGGTGAA
CAGGGTGTTC CTGGAGACCT TGGCGCCCCT GGCCCCTCTG GAGCAAGAGG CGAGAGAGGT
TTCCCTGGCG AGCGTGGTGT GCAAGGTCCC CCTGGTCCTG CTGGTCCCCG AGGGGCCAAC
GGTGCTCCCG GCAACGATGG TGCTAAGGGT GATGCTGGTG CCCCTGGAGC TCCCGGTAGC
CAGGGCGCCC CTGGCCTTCA GGGAATGCCT GGTGAACGTG GTGCAGCTGG TCTTCCAGGG
CCTAAGGGTG ACAGAGGTGA TGCTGGTCCC AAAGGTGCTG ATGGCTCTCC TGGCAAAGAT
GGCGTCCGTG GTCTGACTGG CCCCATTGGT CCTCCTGGCC CTGCTGGTGC CCCTGGTGAC
AAGGGTGAAA GTGGTCCCAG CGGCCCTGCT GGTCCCACTG GAGCTCGTGG TGCCCCCGGA
GACCGTGGTG AGCCTGGTCC CCCCGGCCCT GCTGGCTTTG CTGGCCCCCC TGGTGCTGAC
GGCCAACCTG GTGCTAAAGG CGAACCTGGT GATGCTGGTG CTAAAGGCGA TGCTGGTCCC
CCTGGCCCTG CCGGACCCGC TGGACCCCCT GGCCCCATTG GTAATGTTGG TGCTCCTGGA
GCCAAAGGTG CTCGCGGCAG CGCTGGTCCC CCTGGTGCTA CTGGTTTCCC TGGTGCTGCT
GGCCGAGTCG GTCCTCCTGG CCCCTCTGGA AATGCTGGAC CCCCTGGCCC TCCTGGTCCT
GCTGGCAAAG AAGGCGGCAA AGGTCCCCGT GGTGAGACTG GCCCTGCTGG ACGTCCTGGT
GAAGTTGGTC CCCCTGGTCC CCCTGGCCCT GCTGGCGAGA AAGGATCCCC TGGTGCTGAT
GGTCCTGCTG GTGCTCCTGG TACTCCCGGG CCTCAAGGTA TTGCTGGACA GCGTGGTGTG
GTCGGCCTGC CTGGTCAGAG AGGAGAGAGA GGCTTCCCTG GTCTTCCTGG CCCCTCTGGT
GAACCTGGCA AACAAGGTCC CTCTGGAGCA AGTGGTGAAC GTGGTCCCCC TGGTCCCATG
GGCCCCCCTG GATTGGCTGG ACCCCCTGGT GAATCTGGAC GTGAGGGGGC TCCTGGTGCC
GAAGGTTCCC CTGGACGAGA CGGTTCTCCT GGCGCCAAGG GTGACCGTGG TGAGACCGGC
CCCGCTGGAC CCCCTGGTGC TCCTGGTGCT CCTGGTGCCC CTGGCCCCGT TGGCCCTGCT
GGCAAGAGTG GTGATCGTGG TGAGACTGGT CCTGCTGGTC CCGCCGGTCC TGTCGGCCCT
GTTGGCGCCC GTGGCCCCGC CGGACCCCAA GGCCCCCGTG GTGACAAGGG TGAGACAGGC
GAACAGGGCG ACAGAGGCAT AAAGGGTCAC CGTGGCTTCT CTGGCCTCCA GGGTCCCCCT
GGCCCTCCTG GCTCTCCTGG TGAACAAGGT CCCTCTGGAG CCTCTGGTCC TGCTGGTCCC
CGAGGTCCCC CTGGCTCTGC TGGTGCTCCT GGCAAAGATG GACTCAACGG TCTCCCTGGC
CCCATTGGGC CCCCTGGTCC TCGCGGTCGC ACTGGTGATG CTGGTCCTGT TGGTCCCCCC
GGCCCTCCTG GACCTCCTGG TCCCCCTGGT CCTCCCAGCG CTGGTTTCGA CTTCAGCTTC
CTGCCCCAGC CACCTCAAGA GAAGGCTCAC GATGGTGGCC GCTACTACCG GGCTGATGAT
GCCAATGTGG TTCGTGACCG TGACCTCGAG GTGGACACCA CCCTCAAGAG CCTGAGCCAG
CAGATCGAGA ACATCCGGAG CCCAGAGGGC AGCCGCAAGA ACCCCGCCCG CACCTGCCGT
GACCTCAAGA TGTGCCACTC TGACTGGAAG AGTGGAGAGT ACTGGATTGA CCCCAACCAA
GGCTGCAACC TGGATGCCAT CAAAGTCTTC TGCAACATGG AGACTGGTGA GACCTGCGTG
TACCCCACTC AGCCCAGTGT GGCCCAGAAG AACTGGTACA TCAGCAAGAA CCCCAAGGAC
AAGAGGCATG TCTGGTTCGG CGAGAGCATG ACCGATGGAT TCCAGTTCGA GTATGGCGGC
CAGGGCTCCG ACCCTGCCGA TGTGGCCATC CAGCTGACCT TCCTGCGCCT GATGTCCACC
GAGGCCTCCC AGAACATCAC CTACCACTGC AAGAACAGCG TGGCCTACAT GGACCAGCAG
ACTGGCAACC TCAAGAAGGC CCTGCTCCTC CAGGGCTCCA ACGAGATCGA GATCCGCGCC
GAGGGCAACA GCCGCTTCAC CTACAGCGTC ACTGTCGATG GCTGCACGAG TCACACCGGA
GCCTGGGGCA AGACAGTGAT TGAATACAAA ACCACCAAGA CCTCCCGCCT GCCCATCATC
GATGTGGCCC CCTTGGACGT TGGTGCCCCA GACCAGGAAT TCGGCTTCGA CGTTGGCCCT
GTCTGCTTCC TGTAA

The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

RefSeq NP_000079.2
CDS127..4521
Misc Feature(1)115..117(+)
Misc Feature(2)244..411(+)
Misc Feature(3)397..399(+)
Misc Feature(4)478..480(+)
Misc Feature(5)607..612(+)
Misc Feature(6)610..660(+)
Misc Feature(7)610..612(+)
Misc Feature(8)634..636(+)
Misc Feature(9)661..3702(+)
Misc Feature(10)832..1011(+)
Misc Feature(11)919..921(+)
Misc Feature(12)940..1098(+)
Misc Feature(13)1480..1731(+)
Misc Feature(14)1651..1812(+)
Misc Feature(15)1732..1911(+)
Misc Feature(16)2128..2301(+)
Misc Feature(17)2227..2403(+)
Misc Feature(18)2359..2367(+)
Misc Feature(19)2983..2988(+)
Misc Feature(20)3403..3411(+)
Misc Feature(21)3448..3450(+)
Misc Feature(22)3616..3618(+)
Misc Feature(23)3703..3780(+)
Misc Feature(24)3748..3750(+)
Misc Feature(25)3778..3783(+)
Misc Feature(26)3808..4518(+)
Exon (1)1..229
Gene:COL1A1
Gene Synonym:
Exon (2)230..424
Gene:COL1A1
Gene Synonym:
Exon (3)425..459
Gene:COL1A1
Gene Synonym:
Exon (4)460..495
Gene:COL1A1
Gene Synonym:
Exon (5)496..597
Gene:COL1A1
Gene Synonym:
Exon (6)598..669
Gene:COL1A1
Gene Synonym:
Exon (7)670..714
Gene:COL1A1
Gene Synonym:
Exon (8)715..768
Gene:COL1A1
Gene Synonym:
Exon (9)769..822
Gene:COL1A1
Gene Synonym:
Exon (10)823..876
Gene:COL1A1
Gene Synonym:
Exon (11)877..930
Gene:COL1A1
Gene Synonym:
Exon (12)931..984
Gene:COL1A1
Gene Synonym:
Exon (13)985..1029
Gene:COL1A1
Gene Synonym:
Exon (14)1030..1083
Gene:COL1A1
Gene Synonym:
Exon (15)1084..1128
Gene:COL1A1
Gene Synonym:
Exon (16)1129..1182
Gene:COL1A1
Gene Synonym:
Exon (17)1183..1281
Gene:COL1A1
Gene Synonym:
Exon (18)1282..1326
Gene:COL1A1
Gene Synonym:
Exon (19)1327..1425
Gene:COL1A1
Gene Synonym:
Exon (20)1426..1479
Gene:COL1A1
Gene Synonym:
Exon (21)1480..1587
Gene:COL1A1
Gene Synonym:
Exon (22)1588..1641
Gene:COL1A1
Gene Synonym:
Exon (23)1642..1740
Gene:COL1A1
Gene Synonym:
Exon (24)1741..1794
Gene:COL1A1
Gene Synonym:
Exon (25)1795..1893
Gene:COL1A1
Gene Synonym:
Exon (26)1894..1947
Gene:COL1A1
Gene Synonym:
Exon (27)1948..2001
Gene:COL1A1
Gene Synonym:
Exon (28)2002..2055
Gene:COL1A1
Gene Synonym:
Exon (29)2056..2109
Gene:COL1A1
Gene Synonym:
Exon (30)2110..2154
Gene:COL1A1
Gene Synonym:
Exon (31)2155..2253
Gene:COL1A1
Gene Synonym:
Exon (32)2254..2361
Gene:COL1A1
Gene Synonym:
Exon (33)2362..2469
Gene:COL1A1
Gene Synonym:
Exon (34)2470..2523
Gene:COL1A1
Gene Synonym:
Exon (35)2524..2577
Gene:COL1A1
Gene Synonym:
Exon (36)2578..2685
Gene:COL1A1
Gene Synonym:
Exon (37)2686..2739
Gene:COL1A1
Gene Synonym:
Exon (38)2740..2793
Gene:COL1A1
Gene Synonym:
Exon (39)2794..2955
Gene:COL1A1
Gene Synonym:
Exon (40)2956..3063
Gene:COL1A1
Gene Synonym:
Exon (41)3064..3171
Gene:COL1A1
Gene Synonym:
Exon (42)3172..3225
Gene:COL1A1
Gene Synonym:
Exon (43)3226..3333
Gene:COL1A1
Gene Synonym:
Exon (44)3334..3387
Gene:COL1A1
Gene Synonym:
Exon (45)3388..3495
Gene:COL1A1
Gene Synonym:
Exon (46)3496..3549
Gene:COL1A1
Gene Synonym:
Exon (47)3550..3657
Gene:COL1A1
Gene Synonym:
Exon (48)3658..3940
Gene:COL1A1
Gene Synonym:
Exon (49)3941..4131
Gene:COL1A1
Gene Synonym:
Exon (50)4132..4374
Gene:COL1A1
Gene Synonym:
Exon (51)4375..5927
Gene:COL1A1
Gene Synonym:
Translation

Target ORF information:

RefSeq Version NM_000088
Organism Homo sapiens (human)
Definition Homo sapiens collagen, type I, alpha 1 (COL1A1), mRNA.

Target ORF information:

Epitope DYKDDDDK
Bacterial selection AMPR
Mammalian selection NeoR
Vector pcDNA3.1+/C-(K)DYK
NM_000088

ORF Insert Sequence:

1
61
121
181
241
301
361
421
481
541
601
661
721
781
841
901
961
1021
1081
1141
1201
1261
1321
1381
1441
1501
1561
1621
1681
1741
1801
1861
1921
1981
2041
2101
2161
2221
2281
2341
2401
2461
2521
2581
2641
2701
2761
2821
2881
2941
3001
3061
3121
3181
3241
3301
3361
3421
3481
3541
3601
3661
3721
3781
3841
3901
3961
4021
4081
4141
4201
4261
4321
4381
ATGTTCAGCT TTGTGGACCT CCGGCTCCTG CTCCTCTTAG CGGCCACCGC CCTCCTGACG 
CACGGCCAAG AGGAAGGCCA AGTCGAGGGC CAAGACGAAG ACATCCCACC AATCACCTGC
GTACAGAACG GCCTCAGGTA CCATGACCGA GACGTGTGGA AACCCGAGCC CTGCCGGATC
TGCGTCTGCG ACAACGGCAA GGTGTTGTGC GATGACGTGA TCTGTGACGA GACCAAGAAC
TGCCCCGGCG CCGAAGTCCC CGAGGGCGAG TGCTGTCCCG TCTGCCCCGA CGGCTCAGAG
TCACCCACCG ACCAAGAAAC CACCGGCGTC GAGGGACCCA AGGGAGACAC TGGCCCCCGA
GGCCCAAGGG GACCCGCAGG CCCCCCTGGC CGAGATGGCA TCCCTGGACA GCCTGGACTT
CCCGGACCCC CCGGACCCCC CGGACCTCCC GGACCCCCTG GCCTCGGAGG AAACTTTGCT
CCCCAGCTGT CTTATGGCTA TGATGAGAAA TCAACCGGAG GAATTTCCGT GCCTGGCCCC
ATGGGTCCCT CTGGTCCTCG TGGTCTCCCT GGCCCCCCTG GTGCACCTGG TCCCCAAGGC
TTCCAAGGTC CCCCTGGTGA GCCTGGCGAG CCTGGAGCTT CAGGTCCCAT GGGTCCCCGA
GGTCCCCCAG GTCCCCCTGG AAAGAATGGA GATGATGGGG AAGCTGGAAA ACCTGGTCGT
CCTGGTGAGC GTGGGCCTCC TGGGCCTCAG GGTGCTCGAG GATTGCCCGG AACAGCTGGC
CTCCCTGGAA TGAAGGGACA CAGAGGTTTC AGTGGTTTGG ATGGTGCCAA GGGAGATGCT
GGTCCTGCTG GTCCTAAGGG TGAGCCTGGC AGCCCTGGTG AAAATGGAGC TCCTGGTCAG
ATGGGCCCCC GTGGCCTGCC TGGTGAGAGA GGTCGCCCTG GAGCCCCTGG CCCTGCTGGT
GCTCGTGGAA ATGATGGTGC TACTGGTGCT GCCGGGCCCC CTGGTCCCAC CGGCCCCGCT
GGTCCTCCTG GCTTCCCTGG TGCTGTTGGT GCTAAGGGTG AAGCTGGTCC CCAAGGGCCC
CGAGGCTCTG AAGGTCCCCA GGGTGTGCGT GGTGAGCCTG GCCCCCCTGG CCCTGCTGGT
GCTGCTGGCC CTGCTGGAAA CCCTGGTGCT GATGGACAGC CTGGTGCTAA AGGTGCCAAT
GGTGCTCCTG GTATTGCTGG TGCTCCTGGC TTCCCTGGTG CCCGAGGCCC CTCTGGACCC
CAGGGCCCCG GCGGCCCTCC TGGTCCCAAG GGTAACAGCG GTGAACCTGG TGCTCCTGGC
AGCAAAGGAG ACACTGGTGC TAAGGGAGAG CCTGGCCCTG TTGGTGTTCA AGGACCCCCT
GGCCCTGCTG GAGAGGAAGG AAAGCGAGGA GCTCGAGGTG AACCCGGACC CACTGGCCTG
CCCGGACCCC CTGGCGAGCG TGGTGGACCT GGTAGCCGTG GTTTCCCTGG CGCAGATGGT
GTTGCTGGTC CCAAGGGTCC CGCTGGTGAA CGTGGTTCTC CTGGCCCTGC TGGCCCCAAA
GGATCTCCTG GTGAAGCTGG TCGTCCCGGT GAAGCTGGTC TGCCTGGTGC CAAGGGTCTG
ACTGGAAGCC CTGGCAGCCC TGGTCCTGAT GGCAAAACTG GCCCCCCTGG TCCCGCCGGT
CAAGATGGTC GCCCCGGACC CCCAGGCCCA CCTGGTGCCC GTGGTCAGGC TGGTGTGATG
GGATTCCCTG GACCTAAAGG TGCTGCTGGA GAGCCCGGCA AGGCTGGAGA GCGAGGTGTT
CCCGGACCCC CTGGCGCTGT CGGTCCTGCT GGCAAAGATG GAGAGGCTGG AGCTCAGGGA
CCCCCTGGCC CTGCTGGTCC CGCTGGCGAG AGAGGTGAAC AAGGCCCTGC TGGCTCCCCC
GGATTCCAGG GTCTCCCTGG TCCTGCTGGT CCTCCAGGTG AAGCAGGCAA ACCTGGTGAA
CAGGGTGTTC CTGGAGACCT TGGCGCCCCT GGCCCCTCTG GAGCAAGAGG CGAGAGAGGT
TTCCCTGGCG AGCGTGGTGT GCAAGGTCCC CCTGGTCCTG CTGGTCCCCG AGGGGCCAAC
GGTGCTCCCG GCAACGATGG TGCTAAGGGT GATGCTGGTG CCCCTGGAGC TCCCGGTAGC
CAGGGCGCCC CTGGCCTTCA GGGAATGCCT GGTGAACGTG GTGCAGCTGG TCTTCCAGGG
CCTAAGGGTG ACAGAGGTGA TGCTGGTCCC AAAGGTGCTG ATGGCTCTCC TGGCAAAGAT
GGCGTCCGTG GTCTGACTGG CCCCATTGGT CCTCCTGGCC CTGCTGGTGC CCCTGGTGAC
AAGGGTGAAA GTGGTCCCAG CGGCCCTGCT GGTCCCACTG GAGCTCGTGG TGCCCCCGGA
GACCGTGGTG AGCCTGGTCC CCCCGGCCCT GCTGGCTTTG CTGGCCCCCC TGGTGCTGAC
GGCCAACCTG GTGCTAAAGG CGAACCTGGT GATGCTGGTG CTAAAGGCGA TGCTGGTCCC
CCTGGCCCTG CCGGACCCGC TGGACCCCCT GGCCCCATTG GTAATGTTGG TGCTCCTGGA
GCCAAAGGTG CTCGCGGCAG CGCTGGTCCC CCTGGTGCTA CTGGTTTCCC TGGTGCTGCT
GGCCGAGTCG GTCCTCCTGG CCCCTCTGGA AATGCTGGAC CCCCTGGCCC TCCTGGTCCT
GCTGGCAAAG AAGGCGGCAA AGGTCCCCGT GGTGAGACTG GCCCTGCTGG ACGTCCTGGT
GAAGTTGGTC CCCCTGGTCC CCCTGGCCCT GCTGGCGAGA AAGGATCCCC TGGTGCTGAT
GGTCCTGCTG GTGCTCCTGG TACTCCCGGG CCTCAAGGTA TTGCTGGACA GCGTGGTGTG
GTCGGCCTGC CTGGTCAGAG AGGAGAGAGA GGCTTCCCTG GTCTTCCTGG CCCCTCTGGT
GAACCTGGCA AACAAGGTCC CTCTGGAGCA AGTGGTGAAC GTGGTCCCCC TGGTCCCATG
GGCCCCCCTG GATTGGCTGG ACCCCCTGGT GAATCTGGAC GTGAGGGGGC TCCTGGTGCC
GAAGGTTCCC CTGGACGAGA CGGTTCTCCT GGCGCCAAGG GTGACCGTGG TGAGACCGGC
CCCGCTGGAC CCCCTGGTGC TCCTGGTGCT CCTGGTGCCC CTGGCCCCGT TGGCCCTGCT
GGCAAGAGTG GTGATCGTGG TGAGACTGGT CCTGCTGGTC CCGCCGGTCC TGTCGGCCCT
GTTGGCGCCC GTGGCCCCGC CGGACCCCAA GGCCCCCGTG GTGACAAGGG TGAGACAGGC
GAACAGGGCG ACAGAGGCAT AAAGGGTCAC CGTGGCTTCT CTGGCCTCCA GGGTCCCCCT
GGCCCTCCTG GCTCTCCTGG TGAACAAGGT CCCTCTGGAG CCTCTGGTCC TGCTGGTCCC
CGAGGTCCCC CTGGCTCTGC TGGTGCTCCT GGCAAAGATG GACTCAACGG TCTCCCTGGC
CCCATTGGGC CCCCTGGTCC TCGCGGTCGC ACTGGTGATG CTGGTCCTGT TGGTCCCCCC
GGCCCTCCTG GACCTCCTGG TCCCCCTGGT CCTCCCAGCG CTGGTTTCGA CTTCAGCTTC
CTGCCCCAGC CACCTCAAGA GAAGGCTCAC GATGGTGGCC GCTACTACCG GGCTGATGAT
GCCAATGTGG TTCGTGACCG TGACCTCGAG GTGGACACCA CCCTCAAGAG CCTGAGCCAG
CAGATCGAGA ACATCCGGAG CCCAGAGGGC AGCCGCAAGA ACCCCGCCCG CACCTGCCGT
GACCTCAAGA TGTGCCACTC TGACTGGAAG AGTGGAGAGT ACTGGATTGA CCCCAACCAA
GGCTGCAACC TGGATGCCAT CAAAGTCTTC TGCAACATGG AGACTGGTGA GACCTGCGTG
TACCCCACTC AGCCCAGTGT GGCCCAGAAG AACTGGTACA TCAGCAAGAA CCCCAAGGAC
AAGAGGCATG TCTGGTTCGG CGAGAGCATG ACCGATGGAT TCCAGTTCGA GTATGGCGGC
CAGGGCTCCG ACCCTGCCGA TGTGGCCATC CAGCTGACCT TCCTGCGCCT GATGTCCACC
GAGGCCTCCC AGAACATCAC CTACCACTGC AAGAACAGCG TGGCCTACAT GGACCAGCAG
ACTGGCAACC TCAAGAAGGC CCTGCTCCTC CAGGGCTCCA ACGAGATCGA GATCCGCGCC
GAGGGCAACA GCCGCTTCAC CTACAGCGTC ACTGTCGATG GCTGCACGAG TCACACCGGA
GCCTGGGGCA AGACAGTGAT TGAATACAAA ACCACCAAGA CCTCCCGCCT GCCCATCATC
GATGTGGCCC CCTTGGACGT TGGTGCCCCA GACCAGGAAT TCGGCTTCGA CGTTGGCCCT
GTCTGCTTCC TGTAA

The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

 
*
*
*
*