Freshersworld does not charge any amount for job placement. Beware of fraudsters who ask you to pay on the pretext of giving a job. Know More

Post A Job

Data Integration Engineer Jobs in Gurgaon - ClimaCell

Data Integration Engineer

ClimaCell
experience-icon 0 to 3 Years
salary-icon Not disclosed
qualification-icon BE/B.Tech, Other Course
Expired

Posted: 01 Mar 21

Job Description

Climacell's R&D Team is a mixture of scientists and engineers committed to generating the best and most novel data and models across all times: historical, real-time and forecast.

We also focus on making everything become a weather station (from cars to microwave links to IoT devices).

The story just begins when the data hits our ingest and post-processing services.

Every product that the user sees is the result of a pipeline of algorithms that needs to be run quickly and continuously. We are the team that builds the architecture behind the data and the models, to prepare the weather analyses, for the Product and Engineering team to serve to the masses.

As Data Integration Engineers, you'll ensure we are getting high quality and current data from the source systems, and that this data is reliably uploaded to the cloud storage in a uniform format that can be consumed by the ClimaCell models.

The ideal candidate would possess several skill sets including proficiency with python, data technologies and SQL, passion for data analytics, and excellent communication skills.

You will be working closely with Meteorological Data Engineer, understand requirements and build seamless data pipes that include but are not limited to extraction of data directly from sources, normalize or transform into unified schemas and structure, load it into databases with highest possible data quality and lowest possible latency. And when required working with Sr. Data scientists to refine and align technical and scientific requirements.

This is a fantastic role for a data geek with awesome skills for someone that wants to jump into an extremely fast-cycle deployment world. Your energy and enthusiasm to work and grow alongside our high-trajectory team are essential!

What You'll Be Doing
  • Develops and maintains scalable data pipelines using third party platform or custom development using web scraping, python based data retrieval applications
  • Builds and maintain new API integrations to support continuing increases in data volume and complexity
  • Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
  • Collaborates with Meteorological Data Engineer and data scientists to improve observation collection methodologies, data pipelines, observation quality to increase data accessibility
  • Writes unit/integration tests, contributes to engineering wiki, and documents work.
  • Performs data analysis required to troubleshoot data related issues and assist in the resolution of data issues.
  • Works closely with all business units and engineering teams to develop strategy for long term data platform architecture.


What You Bring
  • 5+ years of experience in a Data Integration Engineer role, who has attained a Graduate and Post-Graduate degree in Computer Science, Statistics, Informatics, Information Systems or another quantitative field
  • 5+ years experience in Python with working knowledge of building ETL pipelines/processes supporting data transformation, data structures, metadata, dependency and workload management.
  • 3+ years Data Integration technologies and principles. Experience as a data engineer (working with databases,data pipelines, architectures and data sets)
  • Strong experience integrating data from structured and unstructured formats e.g. XML, EDI, JSON, CSVs
  • Experience working with files, databases and streamed data
  • Experience integrating data from various sources in multiple protocol environments such as REST, HTTP, FTP / SFTP, RSS meetds etc
  • Experience scraping and parsing HTML pages to extract embedded data
  • Working knowledge of message queuing, stream processing, and highly scalable 'big data' data stores
  • Experience with relational SQL and NoSQL databases, including Postgres.
  • Winning + Can-Do attitude, fire & forget personality, can take responsibilities, friendly and approachable, ready to take challenges individually, last but not the least excellent team player.

Bonus points
  • Experience on various cloud based data integration platforms such as Azure's data factory, Xplenty, Talent, Informatica power center etc.
  • Familiarity with meteorological file formats such as GRIB2, BUFR, and NetCDF
  • Data science background with automation experience.
  • Experience with AWS and Google cloud services: EC2, EMR, RDS, Redshift, Bigquery, Cloud SQL, Compute engine etc
  • Experience with data pipeline and workflow management tools: Azkaban, Luigi, Airflow, etc.

So if Data is your middle name, you're a creative thinker and your friends call you when there's a big problem to solve and if you love rain - this is the team for you.

If you have reached this point and you are super excited but not sure you check all the boxes - we still want to speak with you! Your passion is priceless. Other things can be learned.

___________________________________________________________________________________________

About ClimaCell:

ClimaCell is the world's leading weather intelligence engine, powering the most compelling insights for teams to drive actions and collaboration around the world.

Building the most robust weather platform in the world, ClimaCell's technology enables partners like JetBlue, Ford, AWS, The New England Patriots, National Grid and more to understand how the weather impacts their business operations and implement predictive action plans to improve efficiency, safety, and revenue.

ClimaCell's stands apart through its AI powered Weather-of-Things technology, which offers hyper local weather insights for businesses via aggregation and data modeling from wireless networks, street cameras, drones, connected cars and more all around the globe.

Founded in 2015 by Harvard Business School and MIT Sloan former jet pilots and weather enthusiasts, ClimaCell is in hyper growth mode with 7x revenue growth in 2019 and offices in Boston, Boulder, Tel Aviv, and Singapore. ClimaCell also operates ClimaCell.org, a non-profit focused on improving access to weather data globally, helping save and transform billions of lives in developing countries.

How we roll: We work in an 'one office' environment. We believe that magic happens when people work together. Together also includes Zoom meetings, flexible hours and unlimited vacation days. Your success is achieved by your impact and deliveries and not by the hours you put in. We believe in transparency and directness, putting work before ego and empathy. We grow fast and move faster but we always see people first. Each person has their own career growth path for we believe that the only

Job Particulars

Education BE/B.Tech, Other Course
Who can apply Freshers and Experienced (0 to 3 Years )
Hiring Process Face to Face Interview
Employment Type0
Job Id1132984
Job Category Core Technical
Locality Address
Country India

About Company

ClimaCell
Jobs By Location
Others also searched for
Job & career videos
scroll-icon scroll-icon
scroll-icon youtube-img
scroll-icon youtube-img
scroll-icon youtube-img
scroll-icon youtube-img
scroll-icon youtube-img
scroll-icon youtube-img
scroll-icon youtube-img
scroll-icon youtube-img
scroll-icon youtube-img
ARE YOU A FRESHER? REGISTER NOW
Looking for your first Dream Job?
Update Resume
Upload Resume