Find Jobs
Hire Freelancers

Data Search and Parsing - Schools and Universities Globally

$30-100 USD

Ditutup
Disiarkan lebih dari 18 tahun yang lalu

$30-100 USD

Dibayar semasa penghantaran
for [login to view URL] - please get an account and create a private test journal to familiarise yourself with how this site works. SKILLS SUMMARY ============== You must have the following skills: * good search and research skills - finding datasets * any scripting language (your choice) ideally with Linux * web knowledge for retrieving and parsing data from sites (e.g. using the CURL library) * good understanding of SQL - preferably Postgresql * knowledge of character sets and encodings (we use UTF-8 for our database and text) * be tidy, meticulous and produce high quality data and code We will keep in reserve up to 25% on top of the bid price as a bonus which will be delivered depending on the quality. You will also be considered for future work, as we are an on-going operation and have several requirements. This can be considered your trial. Note that a lot of the research has already been done, so please look at the attached files, resources etc before putting in your quote. You are more likely to be the successful bid if you can send some code samples, to check for formatting, style, quality and structure. PROJECT DESCRIPTION =================== I would like to gather a list of all schools, colleges and universities globally. Your deliverable will be the script to gather this information from the respective websites, or from downloaded files and the resultant .SQL files containing the INSERT statements for the data. You will need to: * take the existing online resources (see below) and search for other additional data sources and resources to include educational institutions globally * ensure that these data sources contain the mandatory fields * download these data sources, or write scripts to harvest the data from the web pages where the information is present * deliver the resulting .SQL INSERT statements for the postgres database to insert these rows into the database table, and a list of sources, codes and references explaining where you obtained them MANDATORY fields ================ Each educational institution MUST HAVE THE FOLLOWING: * name - (UTF-8 encoding please) * two letter country code - e.g. US, NZ, AU, IN, TH etc - (ISO 3166 country code, see the attached base_country SQL file) * longitude and latitude (geographical coordinates) - these must be _decimal formatted_ and to at least three decimal places ie [login to view URL] - by decimal formatted, i mean where the decimal is out of 60 seconds, it needs to be converted to be out of 100 * type (either 'primary', 'secondary' or 'tertiary') - primary schools are for up to ages of approximately 10-14 - secondary schools are more normally known as high schools - tertiary is university, (or US: college), or polytechnic, technical school, MBA course, medical schools, professional training schools, OPTIONAL fields (where available) ================================= In decreasing order of importance: * name of town/city * ideally: the state, region, province or territory where the institution is found (e.g. CA for California, or Surat Thani in Thailand) - any format, e.g. text * type of school (e.g. IT, medical studies, arts, physics, engineering etc) * address of the educational institution * website of the institution * phone, email, fax of the institution Coverage ======== It would be good if you could find at least secondary and tertiary coverage globally, ie for each and every country, but we must have full coverage (primary, secondary, tertiary) for the following countries: United States, Canada, South Africa, Australia, New Zealand, EU countries (Britain, France, Germany, Sweden, Denmark, Norway, Spain, Italy, Poland etc), Asian countries (Japan, Korea, India, Thailand, ideally China, Hong Kong, Taiwan) and Ukraine/Russia (if available). Other countries (Bangladesh, Indonesia, African countries etc) will help earn your bonus, if you can manage them. Note that the trickiest ones here are the ones with foreign character sets (Chinese, Korean, Japanese, Russian). You can make a separate quote for this if you wish, after looking at what is available online, but you may well find a list that covers these places anyway. Resources and Reading ===================== List of colleges and universities (tertiary institutions) by country: [login to view URL] [login to view URL] Colleges and Universities by Country (seems to lack coordinates, pls check) [login to view URL] Schools in the World (seems to lack Country, and not have type) [login to view URL] [login to view URL] Site with lists of these things: [login to view URL] Google for other data sets, A function which converts decimal latitude and longitude into traditional (degrees/60 seconds) format:
ID Projek: 24938

Tentang projek

10 cadangan
Projek jarak jauh
Aktif 17 tahun yang lalu

Ingin menjana wang?

Faedah membida di Freelancer

Tetapkan bajet dan garis masa anda
Dapatkan bayaran untuk kerja anda
Tuliskan cadangan anda
Ianya percuma untuk mendaftar dan membida pekerjaan

Tentang klien

Bendera UNITED KINGDOM
London, United Kingdom
0.0
2
Ahli sejak Ogo 23, 2005

Pengesahan Klien

Terima kasih! Kami telah menghantar pautan melalui e-mel kepada anda untuk menuntut kredit percuma anda.
Sesuatu telah berlaku semasa menghantar e-mel anda. Sila cuba lagi.
Pengguna Berdaftar Jumlah Pekerjaan Disiarkan
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Memuatkan pratonton
Kebenaran diberikan untuk Geolocation.
Sesi log masuk anda telah luput dan telah dilog keluar. Sila log masuk sekali lagi.