Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Align step fails with could not open input file /geneInfo.tab #83

Open
benostendorf opened this issue Jun 6, 2024 · 5 comments
Open

Comments

@benostendorf
Copy link

I have deployed the latest version of this workflow (v2.1.2) and only adjusted the config files. I use the ENSEMBL GRCm38 mouse reference genome (release 102).
Unfortunately, the mapping step fails with the following error:

Transcriptome.cpp:18:Transcriptome: exiting because of *INPUT FILE* error: could not open input file /geneInfo.tab
Solution: check that the file exists and you have read permission for this file
          SOLUTION: utilize --sjdbGTFfile /path/to/annotations.gtf option at the genome generation step or mapping step

Jun 06 19:42:37 ...... FATAL ERROR, exiting

This seems to be the same error as reported here.
Thank you for looking into this!

@dlaehnemann
Copy link
Contributor

From what I understand from the issue you linked, this is a bug in STAR and we will have to wait for the fix that the maintainer of STAR has already created (and links to in the bug report) is properly released (in the main STAR repository, not the separate pre-release repository). You could ask the maintainer in the issue to create a new release, noting that you also need this. Once he does this, I am glad to help with the necessary follow-up, which will be:

  1. Release the new version on bioconda.
  2. Update the STAR wrapper(s) in the snakemake wrappers repository.
  3. Switch to the newest STAR wrapper versions in the workflow.

The only other possibility I see, if the maintainer doesn't create a release soon, would be to patch up the bioconda recipe with bugfix. This could be very straightforward, but I don't currently have the bandwidth to investigate this further. But feel free to ping me with questions regarding this, if you want to pursue that solution further.

@benostendorf
Copy link
Author

Thanks for the prompt reply! I asked for creation of a new release, hopefully this will happen soon.

@dlaehnemann
Copy link
Contributor

Maybe, just to be sure, a two more questions:

  1. Did you just use the automatic GTF download that the workflow does, or did you manually put in place a GTF file. I guess that mismatches due to a manually added GTF might lead to issues.
  2. And in either case, could you check that the GTF file contains the gene_id tag? The maintainer was mentioning this as a requirement in the issue you linked to:
    STAR FATAL ERROR, exiting / Transcriptome.cpp:18:Transcriptome: exiting because of *INPUT FILE* error alexdobin/STAR#1953 (comment)

Because you never know, the error could be triggered by something different than the bug reported there. And also, the GitHub Actions continuous integration tests here just ran through these past days with a little fix to the STAR rules. So it's a bit funny that this fails on your end.

@benostendorf
Copy link
Author

benostendorf commented Jun 7, 2024 via email

@dlaehnemann
Copy link
Contributor

OK, at least this is not broken, then... 😅

So it is probably some weird interaction of this bug with the software stack... 🤷

Fingers crossed, that the release of the bug fix makes this go away.

Otherwise, we more regularly use the rna-seq-kallisto-sleuth workflow ourselves, and maintain and improve it constantly. So maybe this is an alternative you might want to try for your RNAseq Analysis:
https://snakemake.github.io/snakemake-workflow-catalog/?repo=snakemake-workflows/rna-seq-kallisto-sleuth

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants