大家好。我尝试使用STAR构建基因index,fasta用的是https://ftp.ensembl.org/pub/release-109/fasta/mus_musculus/dna/,GTF用的是https://ftp.ensembl.org/pub/release-109/gtf/mus_musculus/
但是在运行时GTF文件报错,说第三列没有exon。然而我用的GTF第三列确实有exon信息。
请大家帮忙解答
STAR --runMode genomeGenerate --genomeDir ~/reference_genome/index/star/mm39 --genomeFastaFiles Mus_musculus.GRCm39.dna.primary_assembly.fa --sjdbGTFfile Mus_musculus.GRCm39.109.chr.gtf.gz --sjdbOverhang 150 --runThreadN 5 --readFilesCommand zcat
STAR version: 2.7.10b compiled: 2022-11-01T09:53:26-04:00 :/home/dobin/data/STAR/STARcode/STAR.master/source
Jul 07 18:24:08 ..... started STAR run
Jul 07 18:24:08 ... starting to generate Genome files
Jul 07 18:24:47 ..... processing annotations GTF
Fatal INPUT FILE error, no exon lines in the GTF file: Mus_musculus.GRCm39.109.chr.gtf.gz
Solution: check the formatting of the GTF file, it must contain some lines with exon in the 3rd column.
Make sure the GTF file is unzipped.
If exons are marked with a different word, use --sjdbGTFfeatureExon .
Jul 07 18:24:47 ...... FATAL ERROR, exiting
根据报错信息,STAR在处理GTF文件时没有找到第三列为"exon"的行,这通常是GTF文件格式错误或者未解压导致的。
你可以检查几个方面: