For research of open-source contributions we need scraped data from Drupal.org.
Source: [login to view URL]
Configurable fields for Scraper:
I. 'Core compatibility ' the rest as default (sort by 'Most installed' etc.)
II. Number of scraping results we want: X (maximum is around 12.000 for Drupal 7 at the moment). There are 25 results, than a pager.
See for an example of scraping data the .csv format file attached. Please mind:
- The rank is the place of the teaser in the Module (filtered) overview (/project/project_module)
- Mind that the 'Reported installs' and'Downloads' needs to be an integer without comma
- the 'Posted by' date is formatted as date
- there can be 1 to many 'Maintainers'
Please make sure your app doesn't make too much requests to [login to view URL], we don't want to hurt the servers!
Not to complex for someone with scraper experience. Please make sure you reply with a custom message, default replies will not be read.
Hi there, I have read the project & can be done in a day.. I have good web scraping skills in Python & can make this script according to the project description.. Hope to hear from you..
7 pekerja bebas membida secara purata €151 untuk pekerjaan ini
Hi, I have read your posting job carefully. I am looking for a job like this. I can do your job perfectly and also assure you that my work will be 100% satisfied to you.
Hi, i would like to work on your project. Can show source code of previous works to confirm my skills. I use logging framework and css selectors in my parsers. feel free to ask any questions.