Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is the original dataset available? #1

Open
limjcst opened this issue Nov 18, 2019 · 0 comments
Open

Is the original dataset available? #1

limjcst opened this issue Nov 18, 2019 · 0 comments

Comments

@limjcst
Copy link

limjcst commented Nov 18, 2019

Thanks for your brilliant work!

I follow the link to sealuzh/msr18-docker-dataset and download the database dump.
After executing the following query, I get nothing :(

# select * from project where giturl like '%9seconds/homebrew-q';
 project_id | git_url | created_at | i_forks | giturl | i_network_count | i_open_issues | i_owner_type | repo_id | repo_path | i_size | i_stargazers | i_subscribers | i_watchers 
------------+---------+------------+---------+--------+-----------------+---------------+--------------+---------+-----------+--------+--------------+---------------+------------
(0 rows)

9seconds/homebrew-q is listed in the file below, and the repository does exist now.

I suspect search engine did not index the mentioned project when you retrieved repositories with Dockerfile in 2018.
It will be so kind of you if you could provide the original dataset.
(Actually, I am trying to get exact commits when you built docker images.)

PS:
Among 560 projects you built, 469 cannot be queried in the new dataset. 3 projects have two entries. Here is an example

# select * from project where giturl like '%nicferrier/elnode';
 project_id |                git_url                 | created_at | i_forks |                giturl                | i_network_count | i_open_issues | i_owner_type | repo_id |     repo_path     | i_size | i_stargazers | i_subscribers | i_watchers 
------------+----------------------------------------+------------+---------+--------------------------------------+-----------------+---------------+--------------+---------+-------------------+--------+--------------+---------------+------------
       2649 | git://github.com/nicferrier/elnode.git |          0 |      48 | https://github.com/nicferrier/elnode |              48 |            20 | User         |  962447 | nicferrier/elnode |   2084 |          445 |            44 |        445
      11304 | git://github.com/nicferrier/elnode.git |          0 |      48 | https://github.com/nicferrier/elnode |              48 |            20 | User         |  962447 | nicferrier/elnode |   2084 |          442 |            44 |        442
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant