Genomes and results

From all we know, the genome influences a large portion of the organism's features. We also know that the human genome can be encoded using around 750 MB of data. This data is largely duplicated inside itself, therefore in practice around 375 MB should be sufficient per genome. Finally, genomes differ between people by about 0.1%.

Several simple conclusions can be drawn from these facts. Firstly, the human genome can be stored in the form of a single sample for reference coupled with relatively short records of just the differences between the given sample and the reference one.

Storing DNA in this form would result in differences of size 0.375 MB, which if we take every single bit as a separate variable, results in 375 000 variables. While the range of possible combinations of these variables is difficult to imagine - in the order of $1.40625 * 10^{11}$ - differences between particular people should be possible to recognize and assign to particular differences between their genomes.

Assuming that people are the set of the most complex organisms on our planet we infer that differences between all other organisms should be smaller than differences between representatives of homo sapiens. Those organisms should simplify comparison and correlation between their genomes and differences between respective representatives even further.

How do we compare genomes of some representatives of animals? One placeto start may be the base of https://www.ncbi.nlm.nih.gov/genome.