In this project you should build the parser for the website. Website has table structure which parser will have to parse.
Website - [url removed, login to view]
Please see the attached video file.
Parser should be able to complete the following actions.
1. When table shows first time parser should click to display 100 results ( for further speed ) ( see video )
2. When page will display parser should click the button " Show More Columns " ( see video )
3. Parser should record header ( Gene Symbol Tissue Organism ) ( see video )
4. Parser should start parsing the table, when table is parsed parser should click the next button to parse next page ( » ) ( see video ). Parse should repeat 4. point till there are all records parsed.
if data inside the parsed cell of the table is html link element - parser can store html link element itself. (
<a class="external" target="_blank" href="[url removed, login to view]">SYNJ1</a> )
parser can store parsed data as tsv file, ( same as csv, only tab separated )
Parser should work with other links of this website, please check it also on this link:
[url removed, login to view]*
Can be written in any programming language, but preference is in Java.
29 pekerja bebas membida secara purata €141 untuk pekerjaan ini
Hi Sir/Madam, I'm expert in Python programming and I have a lot of experience with web scraping, so I can help You with this task. I would use requests&BeautifulSoup for parsing this site. Best regards, Fejs.