DESCRIPTION
We are currently seeking a Python Developer who is able to work independently or in a team, and can meet deadlines while producing high-quality code.
The ideal candidate is proactive, professional, self-motivated, and enjoys working on data projects.
Responsibilities include but are not limited to:
Building data pipelines to fetch data from multiple websites
Identifying which websites to gather data from and determining the type of data needed understand the structure of each website you want to scrape.
(HMTL, APIs, Web services)
Cleaning and transforming extracted data for further processing and analysis
Ensuring that the storage solution can handle the volume and variety of collected data
What do you need to know?
Excellent programming skills in Python and HTML
Knowledge of Web services and APIs
Familiarity with Python libraries for scraping and crawling, such as Beautiful Soup and Scrappy.
Experience with unit testing to ensure high-quality software (experience with Selenium is a plus)
Knowledge of SQL and NoSQL databases, such as MongoDB
Basic understanding of Machine Learning, particularly for Natural Language
Experience with R and PowerBI is a plus
We are an equal opportunity employer and welcome all applicants regardless of age, race, color, religion, sex, sexual orientation, gender identity, or disability.