| http://www.w3.org/ns/prov#value | - Since there is so much overlap between the sequences, we expect to see each k-mer about 10 times.The problem is you run out of memory, even with 10G. The reason is that the new fancy high-throughput machines have higher error rates, so every time there is an error you get a unique never-seen-again k-mer.
|