I don’t think you understand the difference between sequencing and assembly, nor how whole genome shotgun methods work.
What they did with the initial chimp sequence is physically chop up the genome into random chunks that were hundreds of bases long, and then sequenced those little chunks. If you do this for many copies of the genome you get chunks of sequence that over lap each other. For example, let’s say I had these chunks of a sentence:
The little
brown fox jumped
little brown fox
jumped over the
over the white
the white fence
fence over the
over the hill.
By overlaying the common words I can rebuild the whole sentence:
The little brown fox jumped over the white fence over the hill.
The little
little brown fox
brown fox jumped
jumped over the
over the white
the white fence
fence over the hill.
That is how the whole genome shotgun method works, in essence. The tough part is assembling all of the tiny pieces into the larger whole, and that is where they used the human genome as a backbone to help put those pieces together. Think of it as using an already assembled jigsaw puzzle to help put together another jigsaw puzzle that is 98% similar. By comparing the puzzles you have a good starting point for figuring out where the sequences fit together.
The important thing here is that they are not changing the actual sequence reads from the chimp genome to match those in the human genome. They are simply using the human genome to figure out how the small chunks of chimp sequence fit together.
When scientists say that certain sequence could not be aligned or assembled they are saying that they don’t know where that chunk of sequenced DNA fits into the larger genome. When you are comparing two genomes from two species you want to compare DNA that is orthologous, so if you don’t know where DNA reads fit into the larger genome then you can’t compare them, even if two chunks of unaligned sequence match 100%. Like realty, it is all about location, location, location. You can’t compare DNA if you don’t know where it fits into the genome.
Repetitive DNA is notoriously hard to align because there are many places where the DNA chunks can overlap. For example:
These two chunks could over lap like this:
ATTTATTTATTTATTTATTTATTTAGACCCGAGCGG
CCGAGCCATTTATTTATTTATTTATTTATTTA
like this:
ATTTATTTATTTATTTATTTATTTAGACCCGAGCGG
CCGAGCCATTTATTTATTTATTTATTTATTTA
like this:
ATTTATTTATTTATTTATTTATTTAGACCCGAGCGG
CCGAGCCATTTATTTATTTATTTATTTATTTA
There is no one unique way in which those two sequences align, so they are unaligned and kept out of a comparison between species.