FB2024_04 , released June 25, 2024
Gene: Dmel\SmE
Open Close
General Information
Symbol
Dmel\SmE
Species
D. melanogaster
Name
Small ribonucleoprotein particle protein SmE
Annotation Symbol
CG18591
Feature Type
FlyBase ID
FBgn0261790
Gene Model Status
Stock Availability
Gene Summary
Small ribonucleoprotein particle protein SmE (SmE) encodes an RNA binding protein that functions as part of a heteroheptameric ring structure called the Sm core. This complex binds to spliceosomal small nuclear RNAs (e.g. U1, U2, U4 and U5) and helps to carry out pre-mRNA splicing. It also forms a heterotrimeric sub-complex with the products of SmF and SNRPG. [Date last reviewed: 2019-03-14] (FlyBase Gene Snapshot)
Also Known As

snRNPE

Key Links
Genomic Location
Cytogenetic map
Sequence location
Recombination map
2-29
RefSeq locus
NT_033779 REGION:7884930..7885450
Sequence
Genomic Maps
Other Genome Views
The following external sites may use different assemblies or annotations than FlyBase.
Function
Gene Ontology (GO) Annotations (14 terms)
Molecular Function (1 term)
Terms Based on Experimental Evidence (0 terms)
Terms Based on Predictions or Assertions (1 term)
CV Term
Evidence
References
enables RNA binding
inferred from electronic annotation with InterPro:IPR047575
Biological Process (2 terms)
Terms Based on Experimental Evidence (0 terms)
Terms Based on Predictions or Assertions (2 terms)
CV Term
Evidence
References
inferred from sequence or structural similarity with SGD:S000005685
inferred from electronic annotation with InterPro:IPR027078
inferred by curator from GO:0071011,GO:0071013
inferred from biological aspect of ancestor with PANTHER:PTN000127555
Cellular Component (11 terms)
Terms Based on Experimental Evidence (4 terms)
CV Term
Evidence
References
inferred from high throughput direct assay
located_in nucleus
inferred from direct assay
inferred from high throughput direct assay
Terms Based on Predictions or Assertions (8 terms)
CV Term
Evidence
References
inferred from biological aspect of ancestor with PANTHER:PTN000127555
inferred from biological aspect of ancestor with PANTHER:PTN000127555
inferred from sequence or structural similarity with SGD:S000005685
inferred from electronic annotation with InterPro:IPR027078
part_of U1 snRNP
inferred from biological aspect of ancestor with PANTHER:PTN000127555
part_of U2 snRNP
inferred from biological aspect of ancestor with PANTHER:PTN000127555
part_of U4 snRNP
inferred from biological aspect of ancestor with PANTHER:PTN000127555
inferred from biological aspect of ancestor with PANTHER:PTN000127555
part_of U5 snRNP
inferred from biological aspect of ancestor with PANTHER:PTN000127555
Protein Family (UniProt)
Belongs to the snRNP Sm proteins family. (Q9VLV5)
Summaries
Gene Snapshot
Small ribonucleoprotein particle protein SmE (SmE) encodes an RNA binding protein that functions as part of a heteroheptameric ring structure called the Sm core. This complex binds to spliceosomal small nuclear RNAs (e.g. U1, U2, U4 and U5) and helps to carry out pre-mRNA splicing. It also forms a heterotrimeric sub-complex with the products of SmF and SNRPG. [Date last reviewed: 2019-03-14]
Gene Group (FlyBase)
U1 SMALL NUCLEAR RIBONUCLEOPROTEIN PARTICLE -
The U1 small nuclear ribonucleoprotein particle (U1 snRNP) contains U1 snRNA and initiates spliceosome assembly by binding to the 5' splice site in pre-mRNA. (Adapted from PMID:11206553 and PMID:21441581).
U2 SMALL NUCLEAR RIBONUCLEOPROTEIN PARTICLE -
The U2 small nuclear ribonucleoprotein particle (U2 snRNP) contains U2 snRNA. It is recruited to the spliceosome after U1 snRNP and forms a stable interaction with the branch site and 3' splice site in pre-mRNA. (Adapted from PMID:23829528 and PMID:21441581).
U5 SMALL NUCLEAR RIBONUCLEOPROTEIN PARTICLE -
The U5 small nuclear ribonucleoprotein particle (U5 snRNP) contains U5 RNA and assembles with U4-U6 snRNP to form the U4-U6.U5 tri-snRNP. U5 snRNA interacts with the 5' and 3' exons. (Adapted from PMID:21441581).
U4-U6 SMALL NUCLEAR RIBONUCLEOPROTEIN PARTICLE -
The U4/U6 small nuclear ribonucleoprotein particle (U4/U6 snRNP) is a complex that contains base-paired U4 and U6 snRNAs. U4/U6 snRNP assembles with U5 to form the U4/U6.U5 tri-snRNP. (Adapted from PMID:21441581).
U4-U6-U5 SMALL NUCLEAR RIBONUCLEOPROTEIN PARTICLE -
U4/U6.U5 tri-snRNP is assembled from U5 and U4/U6 snRNPs. The complex contains U4 and U6 snRNAs base-paired with each other and U6 snRNA. This complex is recruited to the pre-catalytic splicing intermediate, complex B. (Adapted from PMID:21441581).
SPLICEOSOMAL SM PROTEINS -
The spliceosomal Sm proteins are common to all spliceosomal small nuclear ribonucleoproteins (snRNPs). Sm proteins assemble in a stepwise manner onto the Sm site element of the U1, U2, U4 and U5 spliceosomal snRNAs forming a ring-shaped core RNP structure. (Adapted from PMID:11226169).
SPLICEOSOME COMPLEX A -
Nuclear pre-mRNA splicing is catalyzed by the spliceosome which assembles in a step-wise manner. Complex A, the pre-spliceosome, is composed of U1 and U2 snRNPs and proteins involved in recognition of the 5' splice site and branch point. (Adapted from PMID:24452469 and PMID:23118483).
SPLICEOSOME COMPLEX B -
Nuclear pre-mRNA splicing is catalyzed by the spliceosome which assembles in a step-wise manner. After the assembly of complex A on pre-mRNA, the U4-U6 and U5 snRNPs are recruited as a preassembled tri-snRNP to form complex B. U5 snRNP binds exons at the 5' site. (Adapted from PMID:24452469 and PMID:23118483).
SPLICEOSOME COMPLEX C -
Nuclear pre-mRNA splicing is catalyzed by the spliceosome which assembles in a step-wise manner. Complex B undergoes a number of structural rearrangements and U4 and U1 snRNPs dissociate to generate complex C, the catalytic spliceosome. (Adapted from PMID:24452469 and PMID:23118483).
SPLICEOSOME COMPLEX P -
Nuclear pre-mRNA splicing is catalyzed by the spliceosome which assembles in a step-wise manner. Complex P, the post-spliceosomal complex, is formed after the catalytic removal of the intron by complex C. Complex P contains the lariat intron and spliced exons. (Adapted from PMID:24452469 and PMID:23118483).
U7 SMALL NUCLEAR RIBONUCLEOPROTEIN PARTICLE -
The U7 snRNP involved in histone pre-mRNA 3' end processing consists of two core components: an U7 snRNA and an unusual heptameric Sm ring. (Adapted from FBrf0168021 and FBrf0208608).
U7 SM PROTEINS -
The U7 snRNP is involved in histone mRNA 3' end processing. Sm proteins are common to all small nuclear ribonucleoproteins (snRNPs), assembling in a stepwise manner onto snRNAs, forming a ring-shaped core RNP structure. (Adapted from FBrf0168021 and FBrf0208608).
Protein Function (UniProtKB)
Plays a role in pre-mRNA splicing as a core component of the spliceosomal U1, U2, U4 and U5 small nuclear ribonucleoproteins (snRNPs), the building blocks of the spliceosome.
(UniProt, Q9VLV5)
Gene Model and Products
Number of Transcripts
1
Number of Unique Polypeptides
1

Please see the JBrowse view of Dmel\SmE for information on other features

To submit a correction to a gene model please use the Contact FlyBase form

Protein Domains (via Pfam)
Isoform displayed:
Pfam protein domains
InterPro name
classification
start
end
Protein Domains (via SMART)
Isoform displayed:
SMART protein domains
InterPro name
classification
start
end
Structure
Protein 3D structure   (Predicted by AlphaFold)   (AlphaFold entry Q9VLV5)

If you don't see a structure in the viewer, refresh your browser.
Model Confidence:
  • Very high (pLDDT > 90)
  • Confident (90 > pLDDT > 70)
  • Low (70 > pLDDT > 50)
  • Very low (pLDDT < 50)

AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Some regions with low pLDDT may be unstructured in isolation.

Experimentally Determined Structures
Crossreferences
Comments on Gene Model

Gene model reviewed during 5.52

Sequence Ontology: Class of Gene
Transcript Data
Annotated Transcripts
Name
FlyBase ID
RefSeq ID
Length (nt)
Assoc. CDS (aa)
FBtr0079560
521
94
Additional Transcript Data and Comments
Reported size (kB)
Comments
External Data
Crossreferences
Polypeptide Data
Annotated Polypeptides
Name
FlyBase ID
Predicted MW (kDa)
Length (aa)
Theoretical pI
UniProt
RefSeq ID
GenBank
FBpp0079182
11.1
94
10.10
Polypeptides with Identical Sequences

There is only one protein coding transcript and one polypeptide associated with this gene

Additional Polypeptide Data and Comments
Reported size (kDa)
Comments
External Data
Subunit Structure (UniProtKB)

Core component of the spliceosomal U1, U2, U4 and U5 small nuclear ribonucleoproteins (snRNPs), the building blocks of the spliceosome (By similarity). Interacts with the SMN complex (PubMed:18621711).

(UniProt, Q9VLV5)
Crossreferences
Linkouts
Sequences Consistent with the Gene Model
Mapped Features

Click to get a list of regulatory features (enhancers, TFBS, etc.) and gene disruptions (point mutations, indels, etc.) within or overlapping Dmel\SmE using the Feature Mapper tool.

External Data
Crossreferences
Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
Linkouts
Expression Data
Testis-specificity index

The testis specificity index was calculated from modENCODE tissue expression data by Vedelek et al., 2018 to indicate the degree of testis enrichment compared to other tissues. Scores range from -2.52 (underrepresented) to 5.2 (very high testis bias).

0.16

Transcript Expression
Additional Descriptive Data
Marker for
 
Subcellular Localization
CV Term
Polypeptide Expression
mass spectroscopy
Stage
Tissue/Position (including subcellular localization)
Reference
Additional Descriptive Data
Marker for
 
Subcellular Localization
CV Term
Evidence
References
inferred from high throughput direct assay
located_in nucleus
inferred from direct assay
inferred from high throughput direct assay
Expression Deduced from Reporters
High-Throughput Expression Data
Associated Tools

JBrowse - Visual display of RNA-Seq signals

View Dmel\SmE in JBrowse
RNA-Seq by Region - Search RNA-Seq expression levels by exon or genomic region
Reference
See Gelbart and Emmert, 2013 for analysis details and data files for all genes.
Developmental Proteome: Life Cycle
Developmental Proteome: Embryogenesis
External Data and Images
Linkouts
DRscDB - A single-cell RNA-seq resource for data mining and data comparison across species
EMBL-EBI Single Cell Expression Atlas - Single cell expression across species
FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
FlyAtlas2 - A Drosophila melanogaster expression atlas with RNA-Seq, miRNA-Seq and sex-specific data
Fly-FISH - A database of Drosophila embryo and larvae mRNA localization patterns
Flygut - An atlas of the Drosophila adult midgut
Images
Alleles, Insertions, Transgenic Constructs, and Aberrations
Classical and Insertion Alleles ( 4 )
For All Classical and Insertion Alleles Show
 
Other relevant insertions
Transgenic Constructs ( 6 )
For All Alleles Carried on Transgenic Constructs Show
Transgenic constructs containing/affecting coding region of SmE
Transgenic constructs containing regulatory region of SmE
Aberrations (Deficiencies and Duplications) ( 1 )
Inferred from experimentation ( 1 )
Gene disrupted in
Inferred from location ( 0 )
Variants
Variant Molecular Consequences
Alleles Representing Disease-Implicated Variants
Phenotypes
Orthologs
Human Orthologs (via DIOPT v9.1)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
Homo sapiens (Human) (19)
13 of 14
Yes
Yes
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
1  
2 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1  
1 of 14
No
No
1 of 14
No
No
Model Organism Orthologs (via DIOPT v9.1)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
Rattus norvegicus (Norway rat) (21)
12 of 14
Yes
Yes
11 of 14
No
Yes
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
Yes
1 of 14
No
No
1 of 14
No
No
1 of 14
No
Yes
Mus musculus (laboratory mouse) (16)
13 of 14
Yes
Yes
2 of 14
No
No
2 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
Xenopus tropicalis (Western clawed frog) (8)
11 of 13
Yes
Yes
1 of 13
No
No
1 of 13
No
Yes
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
Danio rerio (Zebrafish) (13)
14 of 14
Yes
Yes
2 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
Caenorhabditis elegans (Nematode, roundworm) (13)
14 of 14
Yes
Yes
2 of 14
No
No
2 of 14
No
No
2 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
1 of 14
No
No
Anopheles gambiae (African malaria mosquito) (16)
12 of 12
Yes
Yes
Arabidopsis thaliana (thale-cress) (25)
13 of 13
Yes
Yes
13 of 13
Yes
Yes
2 of 13
No
Yes
2 of 13
No
No
2 of 13
No
No
2 of 13
No
No
2 of 13
No
No
2 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
Saccharomyces cerevisiae (Brewer's yeast) (15)
13 of 13
Yes
Yes
2 of 13
No
No
2 of 13
No
No
2 of 13
No
No
2 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
Schizosaccharomyces pombe (Fission yeast) (13)
12 of 12
Yes
Yes
2 of 12
No
No
2 of 12
No
No
2 of 12
No
No
1 of 12
No
No
1 of 12
No
No
1 of 12
No
No
1 of 12
No
No
1 of 12
No
No
1 of 12
No
No
1 of 12
No
No
1 of 12
No
No
1 of 12
No
No
Escherichia coli (enterobacterium) (0)
Other Organism Orthologs (via OrthoDB)
Data provided directly from OrthoDB:SmE. Refer to their site for version information.
Paralogs
Paralogs (via DIOPT v9.1)
Drosophila melanogaster (Fruit fly) (15)
2 of 13
2 of 13
2 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
1 of 13
Human Disease Associations
FlyBase Human Disease Model Reports
    Disease Ontology (DO) Annotations
    Models Based on Experimental Evidence ( 0 )
    Allele
    Disease
    Evidence
    References
    Potential Models Based on Orthology ( 1 )
    Human Ortholog
    Disease
    Evidence
    References
    Modifiers Based on Experimental Evidence ( 2 )
    Allele
    Disease
    Interaction
    References
    Disease Associations of Human Orthologs (via DIOPT v9.1 and OMIM)
    Note that ortholog calls supported by only 1 or 2 algorithms (DIOPT score < 3) are not shown.
    Homo sapiens (Human)
    Gene name
    Score
    OMIM
    OMIM Phenotype
    DO term
    Complementation?
    Transgene?
    Functional Complementation Data
    Functional complementation data is computed by FlyBase using a combination of the orthology data obtained from DIOPT and OrthoDB and the allele-level genetic interaction data curated from the literature.
    Interactions
    Summary of Physical Interactions
    Summary of Genetic Interactions
    esyN Network Diagram
    Other Interaction Browsers
    Starting gene(s)
    Interaction type
    Interacting gene(s)
    Reference
    Starting gene(s)
    Interaction type
    Interacting gene(s)
    Reference
    External Data
    Subunit Structure (UniProtKB)
    Core component of the spliceosomal U1, U2, U4 and U5 small nuclear ribonucleoproteins (snRNPs), the building blocks of the spliceosome (By similarity). Interacts with the SMN complex (PubMed:18621711).
    (UniProt, Q9VLV5 )
    Linkouts
    DroID - A comprehensive database of gene and protein interactions.
    MIST (protein-protein) - An integrated Molecular Interaction Database
    Pathways
    Signaling Pathways (FlyBase)
    Metabolic Pathways
    External Data
    Linkouts
    KEGG Pathways - A collection of manually drawn pathway maps representing knowledge of molecular interaction, reaction and relation networks.
    Genomic Location and Detailed Mapping Data
    Chromosome (arm)
    2L
    Recombination map
    2-29
    Cytogenetic map
    Sequence location
    FlyBase Computed Cytological Location
    Cytogenetic map
    Evidence for location
    28D2-28D2
    Limits computationally determined from genome sequence between P{PZ}mts02496 and P{EP}CG7231EP2510
    Experimentally Determined Cytological Location
    Cytogenetic map
    Notes
    References
    Experimentally Determined Recombination Data
    Location
    Left of (cM)
    Right of (cM)
    Notes
    Stocks and Reagents
    Stocks (9)
    Genomic Clones (8)
     

    Please Note FlyBase no longer curates genomic clone accessions so this list may not be complete

    cDNA Clones (22)
     

    Please Note This section lists cDNAs and ESTs that fall within the genomic extent of the gene model, which may include cDNAs and ESTs of genes within introns, or of overlapping genes. Please see JBrowse for alignment of the cDNAs and ESTs to the gene model.

    cDNA clones, fully sequenced
    BDGP DGC clones
    Other clones
      Drosophila Genomics Resource Center cDNA clones

      For each fully sequenced cDNA the DGRC maintains various forms of the cDNA (e.g tagged or untagged) in several different host vectors for subsequent cloning and expression in Drosophila and Drosophila cell lines.

      cDNA Clones, End Sequenced (ESTs)
      BDGP DGC clones
        RNAi and Array Information
        Linkouts
        DRSC - Results frm RNAi screens
        Antibody Information
        Laboratory Generated Antibodies
         
        Commercially Available Antibodies
         
        Cell Line Information
        Publicly Available Cell Lines
         
          Other Stable Cell Lines
           
            Other Comments

            Identified as a potential component of the hh signalling pathway in a genome-wide RNAi screen. dsRNA made from templates generated with primers directed affects the extent of expression of a hh signaling reporter construct in Clone 8 cells.

            Identified as a protein which is common to all small nuclear ribonucleoprotein particles (snRNPs).

            Relationship to Other Genes
            Source for database merge of

            Source for merge of: snRNPE CG18591

            Source for merge of: CG18591 BcDNA:GM19936

            Additional comments

            Source for merge of CG18591 BcDNA:GM19936 was a shared cDNA ( date:030728 ).

            Nomenclature History
            Source for database identify of
            Nomenclature comments
            Etymology
            Synonyms and Secondary IDs (8)
            Datasets (0)
            Study focus (0)
            Experimental Role
            Project
            Project Type
            Title
            Study result (0)
            Result
            Result Type
            Title
            External Crossreferences and Linkouts ( 37 )
            Sequence Crossreferences
            NCBI Gene - Gene integrates information from a wide range of species. A record may include nomenclature, Reference Sequences (RefSeqs), maps, pathways, variations, phenotypes, and links to genome-, phenotype-, and locus-specific resources worldwide.
            GenBank Nucleotide - A collection of sequences from several sources, including GenBank, RefSeq, TPA, and PDB.
            GenBank Protein - A collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB.
            RefSeq - A comprehensive, integrated, non-redundant, well-annotated set of reference sequences including genomic, transcript, and protein.
            UniProt/GCRP - The gene-centric reference proteome (GCRP) provides a 1:1 mapping between genes and UniProt accessions in which a single 'canonical' isoform represents the product(s) of each protein-coding gene.
            UniProt/Swiss-Prot - Manually annotated and reviewed records of protein sequence and functional information
            Other crossreferences
            AlphaFold DB - AlphaFold provides open access to protein structure predictions for the human proteome and other key proteins of interest, to accelerate scientific research.
            DRscDB - A single-cell RNA-seq resource for data mining and data comparison across species
            EMBL-EBI Single Cell Expression Atlas - Single cell expression across species
            FlyAtlas2 - A Drosophila melanogaster expression atlas with RNA-Seq, miRNA-Seq and sex-specific data
            KEGG Genes - Molecular building blocks of life in the genomic space.
            MARRVEL_MODEL - MARRVEL (model organism gene)
            Linkouts
            Drosophila Genomics Resource Center - Drosophila Genomics Resource Center (DGRC) cDNA clones
            DroID - A comprehensive database of gene and protein interactions.
            DRSC - Results frm RNAi screens
            Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
            FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
            FlyCyc Genes - Genes from a BioCyc PGDB for Dmel
            Fly-FISH - A database of Drosophila embryo and larvae mRNA localization patterns
            Flygut - An atlas of the Drosophila adult midgut
            FlyMet - A comprehensive tissue-specific metabolomics resource for Drosophila.
            iBeetle-Base - RNAi phenotypes in the red flour beetle (Tribolium castaneum)
            KEGG Pathways - A collection of manually drawn pathway maps representing knowledge of molecular interaction, reaction and relation networks.
            MIST (protein-protein) - An integrated Molecular Interaction Database
            References (65)