geoseriesread

Read Gene Expression Omnibus (GEO) Series (GSE) format data

Syntax

GEOData = geoseriesread(File)

Input Arguments

File

Either of the following:

  • String specifying a file name, a path and file name, or a URL pointing to a file. The referenced file is a Gene Expression Omnibus (GEO) Series (GSE) format file. If you specify only a file name, that file must be on the MATLAB® search path or in the MATLAB Current Folder.

  • MATLAB character array that contains the text of a GEO Series (GSE) format file.

    Tip   You can use the getgeodata function with the 'ToFile' property to retrieve GEO Series (GSE) format data from the GEO database and create a GEO Series (GSE) format file.

Output Arguments

GEOData

MATLAB structure containing the following fields:

  • Header — Header text from the GEO Series (GSE) format file, typically containing a description of the data or experiment information.

  • DataDataMatrix object containing the data from a GEO Series (GSE) format file. The columns and rows of the DataMatrix object correspond to the sample IDs and Ref IDs, respectively, from the GEO Series (GSE) format file.

Description

GEOData = geoseriesread(File) reads a Gene Expression Omnibus (GEO) Series (GSE) format file, and then creates a MATLAB structure, GEOData, with the following fields.

FieldsDescription
HeaderHeader text from the GEO Series (GSE) format file, typically containing a description of the data or experiment information.
DataDataMatrix object containing the data from a GEO Series (GSE) format file. The columns and rows of the DataMatrix object correspond to the sample IDs and Ref IDs, respectively, from the GEO Series (GSE) format file.

Examples

  1. Retrieve Series (GSE) data from the GEO Web site and save it to a file.

    geodata = getgeodata('GSE11287','ToFile','GSE11287.txt');
    
  2. In a subsequent MATLAB session, you can access the Series (GSE) data from your local file, instead of retrieving it from the GEO Web site.

    geodata = geoseriesread('GSE11287.txt')
    
    geodata = 
    
        Header: [1x1 struct]
          Data: [45101x6 bioma.data.DataMatrix]
  3. Access the sample IDs using the colnames property of a DataMatrix object.

    sampleIDs = geodata.Data.colnames
    
    sampleIDs = 
    
      'GSM284935'  'GSM284936'  'GSM284937'  'GSM284938'  'GSM284939'  'GSM284940'
Was this topic helpful?