Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Initial feedback #1

Open
marfox opened this issue Oct 5, 2019 · 0 comments
Open

Initial feedback #1

marfox opened this issue Oct 5, 2019 · 0 comments

Comments

@marfox
Copy link

marfox commented Oct 5, 2019

Hi @vrandezo , I'm pasting below my first thoughts posted on the Wikidata mailing list:

  1. in general, how can we compare datasets with totally different time stamps? Wikidata is alive, Freebase is dead, and the latest DBpedia dump is old;
  2. given that all datasets contain Wikipedia links, perhaps we could use them as a bridge for the comparison, instead of Wikidata mappings. I'm assuming that Freebase and DBpedia entities with Wikidata mappings are subsets of the whole datasets (but this should be verified);
  3. we could use record linkage techniques to connect Wikidata entities with Freebase and DBpedia ones, then assess the agreement in terms of statements per entity. There has been some experimental work (different use case and goal) in the soweego project:
    https://soweego.readthedocs.io/en/latest/validator.html
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant