FB2024_03 , released June 25, 2024
Gene: Dmel\grn
Open Close
General Information
Symbol
Dmel\grn
Species
D. melanogaster
Name
grain
Annotation Symbol
CG9656
Feature Type
FlyBase ID
FBgn0001138
Gene Model Status
Stock Availability
Gene Summary
grain (grn) encodes a transcription factor from the GATA family. It regulates the expression of receptors and adhesion molecules such as those encoded by unc-5 and Fas2 involved in axon guidance. It contributes to multiple other developmental processes including leg and larval spiracle morphogenesis. [Date last reviewed: 2019-03-07] (FlyBase Gene Snapshot)
Also Known As

GATAc, gra, dGATAc, l(3)84Fa, Gata-c

Key Links
Genomic Location
Cytogenetic map
Sequence location
Recombination map
3-48
RefSeq locus
NT_033777 REGION:8147368..8181444
Sequence
Genomic Maps
Other Genome Views
The following external sites may use different assemblies or annotations than FlyBase.
Function
Gene Ontology (GO) Annotations (13 terms)
Molecular Function (5 terms)
Terms Based on Experimental Evidence (1 term)
CV Term
Evidence
References
inferred from direct assay
Terms Based on Predictions or Assertions (5 terms)
CV Term
Evidence
References
Biological Process (7 terms)
Terms Based on Experimental Evidence (3 terms)
CV Term
Evidence
References
inferred from mutant phenotype
inferred from mutant phenotype
involved_in tissue development
inferred from mutant phenotype
Terms Based on Predictions or Assertions (4 terms)
CV Term
Evidence
References
Cellular Component (1 term)
Terms Based on Experimental Evidence (0 terms)
Terms Based on Predictions or Assertions (1 term)
CV Term
Evidence
References
is_active_in nucleus
inferred from biological aspect of ancestor with PANTHER:PTN001600628
Gene Group (FlyBase)
Protein Family (UniProt)
-
Summaries
Gene Snapshot
grain (grn) encodes a transcription factor from the GATA family. It regulates the expression of receptors and adhesion molecules such as those encoded by unc-5 and Fas2 involved in axon guidance. It contributes to multiple other developmental processes including leg and larval spiracle morphogenesis. [Date last reviewed: 2019-03-07]
Gene Group (FlyBase)
GATA TRANSCRIPTION FACTORS -
GATA transcription factors contain one or two zinc fingers with the amino acid sequence CX2CX17CX2C (where x is any other amino acid) that can bind the consensus DNA sequence (A/T) GATA (A/G). (Adapted from FBrf0195200).
Protein Function (UniProtKB)
Transcription factor that is vital to the development of multiple organ systems. Binds to the core consensus sequence 5'-WGATAR-3'.
(UniProt, P91623)
Phenotypic Description (Red Book; Lindsley and Zimm 1992)
grn: grain
Homozygous lethal; filzkorper or embryo not elongated; head skeleton defective.
Summary (Interactive Fly)

zinc finger - GATA family - required during development for shaping the adult legs and the larval posterior spiracles

Gene Model and Products
Number of Transcripts
3
Number of Unique Polypeptides
3

Please see the JBrowse view of Dmel\grn for information on other features

To submit a correction to a gene model please use the Contact FlyBase form

Protein Domains (via Pfam)
Isoform displayed:
Pfam protein domains
InterPro name
classification
start
end
Protein Domains (via SMART)
Isoform displayed:
SMART protein domains
InterPro name
classification
start
end
Structure
Protein 3D structure   (Predicted by AlphaFold)   (AlphaFold entry P91623)

If you don't see a structure in the viewer, refresh your browser.
Model Confidence:
  • Very high (pLDDT > 90)
  • Confident (90 > pLDDT > 70)
  • Low (70 > pLDDT > 50)
  • Very low (pLDDT < 50)

AlphaFold produces a per-residue confidence score (pLDDT) between 0 and 100. Some regions with low pLDDT may be unstructured in isolation.

Experimentally Determined Structures
Crossreferences
Comments on Gene Model

Stop-codon suppression (UGA) postulated; FBrf0216884.

Gene model reviewed during 5.44

Gene model reviewed during 5.49

Sequence Ontology: Class of Gene
Transcript Data
Annotated Transcripts
Name
FlyBase ID
RefSeq ID
Length (nt)
Assoc. CDS (aa)
FBtr0081808
3451
486
FBtr0300040
4090
699
FBtr0330188
4090
712
Additional Transcript Data and Comments
Reported size (kB)

3.112 (longest cDNA)

Comments
External Data
Crossreferences
Polypeptide Data
Annotated Polypeptides
Name
FlyBase ID
Predicted MW (kDa)
Length (aa)
Theoretical pI
UniProt
RefSeq ID
GenBank
FBpp0081304
50.6
486
9.72
FBpp0289317
71.0
699
9.70
Polypeptides with Identical Sequences

None of the polypeptides share 100% sequence identity.

Additional Polypeptide Data and Comments
Reported size (kDa)
Comments
External Data
Crossreferences
InterPro - A database of protein families, domains and functional sites
Linkouts
Sequences Consistent with the Gene Model
Mapped Features

Click to get a list of regulatory features (enhancers, TFBS, etc.) and gene disruptions (point mutations, indels, etc.) within or overlapping Dmel\grn using the Feature Mapper tool.

External Data
Crossreferences
Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
Linkouts
Expression Data
Testis-specificity index

The testis specificity index was calculated from modENCODE tissue expression data by Vedelek et al., 2018 to indicate the degree of testis enrichment compared to other tissues. Scores range from -2.52 (underrepresented) to 5.2 (very high testis bias).

-0.62

Transcript Expression
in situ
Stage
Tissue/Position (including subcellular localization)
Reference
dorsal ectoderm anlage

Comment: anlage in statu nascendi

dorsal head epidermis anlage in statu nascendi

Comment: reported as procephalic ectoderm anlage in statu nascendi

antennal anlage in statu nascendi

Comment: reported as procephalic ectoderm anlage in statu nascendi

visual anlage in statu nascendi

Comment: reported as procephalic ectoderm anlage in statu nascendi

embryonic head | dorsal | precursor

Comment: late embryonic stage 5

organism | 40-60% egg length | dorsal

Comment: late embryonic stage 5

organism | ventral

Comment: late embryonic stage 5

organism | 10-30% egg length | dorsal

Comment: late stage 5; reference states 15-25% egg length

antennal anlage

Comment: reported as procephalic ectoderm anlage

central brain anlage

Comment: reported as procephalic ectoderm anlage

dorsal head epidermis anlage

Comment: reported as procephalic ectoderm anlage

visual anlage

Comment: reported as procephalic ectoderm anlage

antennal primordium

Comment: reported as procephalic ectoderm primordium

central brain primordium

Comment: reported as procephalic ectoderm primordium

visual primordium

Comment: reported as procephalic ectoderm primordium

dorsal head epidermis primordium

Comment: reported as procephalic ectoderm primordium

lateral head epidermis primordium

Comment: reported as procephalic ectoderm primordium

ventral head epidermis primordium

Comment: reported as procephalic ectoderm primordium

dorsal epidermis primordium

Comment: reported as dorsal epidermis anlage

Additional Descriptive Data

grn expression is most prominent in the procephalic region at stages 6-10 and in the posterior spiracles, gut and central nervous system at stage 11-13.

Marker for
 
Subcellular Localization
CV Term
Polypeptide Expression
immunolocalization
Stage
Tissue/Position (including subcellular localization)
Reference
mass spectroscopy
Stage
Tissue/Position (including subcellular localization)
Reference
Additional Descriptive Data

grn protein is expressed in the posterior spiracles, the midgut, and in a patch of cells in the lateral ectoderm as well as in a diverse set of interneurons and motorneurons that extend axons along the major axon tracts.

Marker for
 
Subcellular Localization
CV Term
Evidence
References
Expression Deduced from Reporters
High-Throughput Expression Data
Associated Tools

JBrowse - Visual display of RNA-Seq signals

View Dmel\grn in JBrowse
RNA-Seq by Region - Search RNA-Seq expression levels by exon or genomic region
Reference
See Gelbart and Emmert, 2013 for analysis details and data files for all genes.
Developmental Proteome: Life Cycle
Developmental Proteome: Embryogenesis
External Data and Images
Linkouts
BDGP expression data - Patterns of gene expression in Drosophila embryogenesis
DRscDB - A single-cell RNA-seq resource for data mining and data comparison across species
EMBL-EBI Single Cell Expression Atlas - Single cell expression across species
FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
FlyAtlas2 - A Drosophila melanogaster expression atlas with RNA-Seq, miRNA-Seq and sex-specific data
Fly-FISH - A database of Drosophila embryo and larvae mRNA localization patterns
Flygut - An atlas of the Drosophila adult midgut
Images
Alleles, Insertions, Transgenic Constructs, and Aberrations
Classical and Insertion Alleles ( 11 )
For All Classical and Insertion Alleles Show
 
Other relevant insertions
Transgenic Constructs ( 16 )
For All Alleles Carried on Transgenic Constructs Show
Transgenic constructs containing/affecting coding region of grn
Transgenic constructs containing regulatory region of grn
Aberrations (Deficiencies and Duplications) ( 21 )
Variants
Variant Molecular Consequences
Alleles Representing Disease-Implicated Variants
Phenotypes
For more details about a specific phenotype click on the relevant allele symbol.
Lethality
Allele
Other Phenotypes
Allele
Phenotype manifest in
Allele
Orthologs
Human Orthologs (via DIOPT v9.1)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
Homo sapiens (Human) (8)
11 of 14
Yes
Yes
10 of 14
No
Yes
7 of 14
No
Yes
3 of 14
No
No
1  
3 of 14
No
No
3 of 14
No
No
2 of 14
No
No
1  
2 of 14
No
No
Model Organism Orthologs (via DIOPT v9.1)
Species\Gene Symbol
Score
Best Score
Best Reverse Score
Alignment
Complementation?
Transgene?
Rattus norvegicus (Norway rat) (8)
11 of 14
Yes
Yes
10 of 14
No
Yes
7 of 14
No
Yes
3 of 14
No
No
2 of 14
No
No
2 of 14
No
No
1 of 14
No
No
1 of 14
No
No
Mus musculus (laboratory mouse) (8)
10 of 14
Yes
Yes
10 of 14
Yes
Yes
7 of 14
No
Yes
3 of 14
No
No
3 of 14
No
No
2 of 14
No
No
1  
1 of 14
No
No
1 of 14
No
No
Xenopus tropicalis (Western clawed frog) (12)
9 of 13
Yes
Yes
6 of 13
No
Yes
6 of 13
No
Yes
5 of 13
No
No
2 of 13
No
No
2 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
No
1 of 13
No
Yes
1 of 13
No
Yes
Danio rerio (Zebrafish) (9)
11 of 14
Yes
Yes
10 of 14
No
Yes
10 of 14
No
Yes
8 of 14
No
Yes
7 of 14
No
Yes
3 of 14
No
No
2 of 14
No
No
2 of 14
No
No
1 of 14
No
No
Caenorhabditis elegans (Nematode, roundworm) (12)
6 of 14
Yes
Yes
4 of 14
No
No
4 of 14
No
Yes
3 of 14
No
Yes
3 of 14
No
No
3 of 14
No
No
3 of 14
No
Yes
3 of 14
No
Yes
2 of 14
No
No
2 of 14
No
Yes
2 of 14
No
Yes
1 of 14
No
Yes
Anopheles gambiae (African malaria mosquito) (4)
10 of 12
Yes
Yes
Arabidopsis thaliana (thale-cress) (78)
3 of 13
Yes
Yes
3 of 13
Yes
Yes
3 of 13
Yes
Yes
3 of 13
Yes
Yes
3 of 13
Yes
Yes
2 of 13
No
Yes
2 of 13
No
Yes
2 of 13
No
Yes
2 of 13
No
Yes
2 of 13
No
Yes
2 of 13
No
Yes
2 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
No
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
No
1 of 13
No
Yes
1 of 13
No
No
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
No
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
No
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
1 of 13
No
Yes
Saccharomyces cerevisiae (Brewer's yeast) (9)
6 of 13
Yes
No
6 of 13
Yes
No
5 of 13
No
No
4 of 13
No
No
2 of 13
No
Yes
2 of 13
No
Yes
2 of 13
No
Yes
2 of 13
No
Yes
1 of 13
No
No
Schizosaccharomyces pombe (Fission yeast) (4)
7 of 12
Yes
Yes
6 of 12
No
Yes
3 of 12
No
No
1 of 12
No
No
Escherichia coli (enterobacterium) (0)
Other Organism Orthologs (via OrthoDB)
Data provided directly from OrthoDB:grn. Refer to their site for version information.
Paralogs
Paralogs (via DIOPT v9.1)
Drosophila melanogaster (Fruit fly) (4)
8 of 13
7 of 13
7 of 13
6 of 13
Human Disease Associations
FlyBase Human Disease Model Reports
    Disease Ontology (DO) Annotations
    Models Based on Experimental Evidence ( 0 )
    Allele
    Disease
    Evidence
    References
    Potential Models Based on Orthology ( 8 )
    Modifiers Based on Experimental Evidence ( 0 )
    Allele
    Disease
    Interaction
    References
    Disease Associations of Human Orthologs (via DIOPT v9.1 and OMIM)
    Note that ortholog calls supported by only 1 or 2 algorithms (DIOPT score < 3) are not shown.
    Homo sapiens (Human)
    Gene name
    Score
    OMIM
    OMIM Phenotype
    DO term
    Complementation?
    Transgene?
    Functional Complementation Data
    Functional complementation data is computed by FlyBase using a combination of the orthology data obtained from DIOPT and OrthoDB and the allele-level genetic interaction data curated from the literature.
    Interactions
    Summary of Physical Interactions
    esyN Network Diagram
    Show neighbor-neighbor interactions:
    Show/hide secondary interactors 
    (data from AllianceMine provided by esyN)
    Select Layout:
    Legend:
    Protein
    RNA
    Selected Interactor(s)
    Other Interaction Browsers

    Please see the Physical Interaction reports below for full details
    protein-protein
    Physical Interaction
    Assay
    References
    Summary of Genetic Interactions
    esyN Network Diagram
    Show/hide secondary interactors 
    (data from AllianceMine provided by esyN)
    esyN Network Key:
    Suppression
    Enhancement
    Other Interaction Browsers

    Please look at the allele data for full details of the genetic interactions
    Starting gene(s)
    Interaction type
    Interacting gene(s)
    Reference
    Starting gene(s)
    Interaction type
    Interacting gene(s)
    Reference
    External Data
    Linkouts
    DroID - A comprehensive database of gene and protein interactions.
    MIST (genetic) - An integrated Molecular Interaction Database
    Pathways
    Signaling Pathways (FlyBase)
    Metabolic Pathways
    External Data
    Genomic Location and Detailed Mapping Data
    Chromosome (arm)
    3R
    Recombination map
    3-48
    Cytogenetic map
    Sequence location
    FlyBase Computed Cytological Location
    Cytogenetic map
    Evidence for location
    84F1-84F1
    Limits computationally determined from genome sequence between P{EP}EP3060EP3060 and P{PZ}grn05930
    Experimentally Determined Cytological Location
    Cytogenetic map
    Notes
    References
    84F1-84F2
    (determined by in situ hybridisation)
    Experimentally Determined Recombination Data
    Left of (cM)
    Right of (cM)
    Notes
    Stocks and Reagents
    Stocks (50)
    Genomic Clones (38)
    cDNA Clones (21)
     

    Please Note This section lists cDNAs and ESTs that fall within the genomic extent of the gene model, which may include cDNAs and ESTs of genes within introns, or of overlapping genes. Please see JBrowse for alignment of the cDNAs and ESTs to the gene model.

    cDNA clones, fully sequenced
    BDGP DGC clones
    Other clones
      Drosophila Genomics Resource Center cDNA clones

      For each fully sequenced cDNA the DGRC maintains various forms of the cDNA (e.g tagged or untagged) in several different host vectors for subsequent cloning and expression in Drosophila and Drosophila cell lines.

      cDNA Clones, End Sequenced (ESTs)
      BDGP DGC clones
      RNAi and Array Information
      Linkouts
      DRSC - Results frm RNAi screens
      Antibody Information
      Laboratory Generated Antibodies
       
      Commercially Available Antibodies
       
      Cell Line Information
      Publicly Available Cell Lines
       
        Other Stable Cell Lines
         
          Other Comments

          grn acts cell-autonomously in intersegmental nerve motoneurons to ensure proper axon pathfinding to the dorsal most muscles.

          grn affects organ shape by locally controlling cell rearrangement.

          grn has a distinct expression pattern in embryos; most prominent in the procephalic region at stages 6-10 and in the posterior spiracles, gut and central nervous system at stage 11-13.

          Zygotically active locus involved in the terminal developmental program in the embryo.

          grn mutants display non-elongated filzkorper and a defective head skeleton.

          Relationship to Other Genes
          Source for database merge of

          Source for merge of: grn GATAc

          Source for merge of: grn l(3)84Fa

          Additional comments

          pnr appears to be a chimeric gene, with the 5' part of the gene derived from grn and the 3' part of the gene derived from GATAe.

          "pnr" is a putative chimeric gene derived from the "grn" and "GATAe" genes (where coding sequences of the two parental genes contribute to the coding sequence of the chimeric gene).

          Nomenclature History
          Source for database identify of

          Source for identity of: grn CG9656

          Nomenclature comments
          Etymology
          Synonyms and Secondary IDs (17)
          Reported As
          Symbol Synonym
          Gata2
          Secondary FlyBase IDs
          • FBgn0004213
          • FBgn0015228
          • FBgn0019991
          Datasets (0)
          Study focus (0)
          Experimental Role
          Project
          Project Type
          Title
          Study result (0)
          Result
          Result Type
          Title
          External Crossreferences and Linkouts ( 48 )
          Sequence Crossreferences
          NCBI Gene - Gene integrates information from a wide range of species. A record may include nomenclature, Reference Sequences (RefSeqs), maps, pathways, variations, phenotypes, and links to genome-, phenotype-, and locus-specific resources worldwide.
          GenBank Nucleotide - A collection of sequences from several sources, including GenBank, RefSeq, TPA, and PDB.
          GenBank Protein - A collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB.
          RefSeq - A comprehensive, integrated, non-redundant, well-annotated set of reference sequences including genomic, transcript, and protein.
          UniProt/GCRP - The gene-centric reference proteome (GCRP) provides a 1:1 mapping between genes and UniProt accessions in which a single 'canonical' isoform represents the product(s) of each protein-coding gene.
          UniProt/Swiss-Prot - Manually annotated and reviewed records of protein sequence and functional information
          UniProt/TrEMBL - Automatically annotated and unreviewed records of protein sequence and functional information
          Other crossreferences
          AlphaFold DB - AlphaFold provides open access to protein structure predictions for the human proteome and other key proteins of interest, to accelerate scientific research.
          BDGP expression data - Patterns of gene expression in Drosophila embryogenesis
          DRscDB - A single-cell RNA-seq resource for data mining and data comparison across species
          EMBL-EBI Single Cell Expression Atlas - Single cell expression across species
          FlyAtlas2 - A Drosophila melanogaster expression atlas with RNA-Seq, miRNA-Seq and sex-specific data
          FlyMine - An integrated database for Drosophila genomics
          InterPro - A database of protein families, domains and functional sites
          KEGG Genes - Molecular building blocks of life in the genomic space.
          MARRVEL_MODEL - MARRVEL (model organism gene)
          Linkouts
          Drosophila Genomics Resource Center - Drosophila Genomics Resource Center (DGRC) cDNA clones
          DroID - A comprehensive database of gene and protein interactions.
          DRSC - Results frm RNAi screens
          Eukaryotic Promoter Database - A collection of databases of experimentally validated promoters for selected model organisms.
          FlyAtlas - Adult expression by tissue, using Affymetrix Dros2 array
          FlyCyc Genes - Genes from a BioCyc PGDB for Dmel
          Fly-FISH - A database of Drosophila embryo and larvae mRNA localization patterns
          Flygut - An atlas of the Drosophila adult midgut
          FlyMet - A comprehensive tissue-specific metabolomics resource for Drosophila.
          iBeetle-Base - RNAi phenotypes in the red flour beetle (Tribolium castaneum)
          Interactive Fly - A cyberspace guide to Drosophila development and metazoan evolution
          MIST (genetic) - An integrated Molecular Interaction Database
          References (135)