Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

no output when using process substitution as input #6

Open
rec3141 opened this issue Nov 24, 2017 · 1 comment
Open

no output when using process substitution as input #6

rec3141 opened this issue Nov 24, 2017 · 1 comment

Comments

@rec3141
Copy link

rec3141 commented Nov 24, 2017

I tried the following, which seems to process reads ok but doesn't output anything

filtlong -1 R1.fastq.gz --min_length 1000 --target_bases 100000 <(cat *.fq)

I can cat the fq separately but that takes forever because the libraries are many Gb each and I'll just delete the tmp file immediately anyway

the other alternative is to run filtlong separately for each library, but reading in the references also takes a long time for each one. It would be nice if it output data when using the process substitution on input.

thanks

@rrwick
Copy link
Owner

rrwick commented Nov 26, 2017

This is an interesting one! I suspect the problem comes from the fact that Filtlong currently reads through the input file twice: once to score the reads and once to output them. I did it so Filtlong didn't have to keep the read sequences in memory, just info about the reads. So a 10 GB read file should work on a machine with 8 GB of memory.

I think that with your process substitution, Filtlong is reading through the reads fine the first time through but is getting nothing the second time through. I'm no Linux whiz, but I don't see a way around this one.

The best solution that comes to mind is to have Filtlong accept multiple input files in a single command. Then you could achieve what you want without process substitution.

Ryan

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants