Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

KL-divergence between distributions for fixed length sequences #57

Open
NeonNeon opened this issue Mar 29, 2018 · 1 comment
Open

KL-divergence between distributions for fixed length sequences #57

NeonNeon opened this issue Mar 29, 2018 · 1 comment
Assignees
Labels

Comments

@NeonNeon
Copy link
Collaborator

For sequence length 10,
Compute the probability distribution of all sequences s in alphabet^10-space for each vlmc, then use KL to measure divergence between these distributions.

@NeonNeon NeonNeon self-assigned this Mar 29, 2018
@NeonNeon
Copy link
Collaborator Author

For a sequence length as low as 6 we get similar results as the frobenius distance on the intersection.

box

Average procent of genus in top #genus: 0.85089	Average procent of family in top #family 0.55093
Average distance fraction to genus: 0.17051	Average distance fraction to family 0.65487	Average distance: 1.00000```

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant