Do you want to work on cutting-edge projects with the world’s best IT engineers? Do you wish you could control which projects to work on and choose your own pay rate? Are you interested in the future of work and how the cloud will form teams? If so - the Gigster Talent Network is for you.
Our clients rely on our Network for two main areas, Software Development and Cloud Services. In some cases, they need help building great new products, in others they want our expertise in migrating, maintaining, and optimizing their cloud solutions.
At Gigster, whether working with entrepreneurs to realize ‘the next great vision’ or with Fortune 500 companies to deliver a big product launch, we build really cool enterprise software on cutting-edge technology.
We are seeking an experienced Data Engineer with deep expertise in data transformation at scale, particularly in integrating and processing data from third-party public APIs. This role is critical to enhancing and maintaining data pipelines that feed into Natural Language Processing (NLP) models.
Design, build, and optimize scalable ETL/ELT data pipelines using Apache Spark, Apache Kafka, and orchestration tools such as Prefect or Airflow
Integrate external data sources and public APIs with internal data systems
Work with large-scale datasets to support NLP model training and inference
Analyze existing pipelines and recommend enhancements for performance, reliability, and scalability
Collaborate with cross-functional teams, including data scientists and ML engineers
Own the end-to-end engineering process—from planning and technical design to implementation
Regularly report progress and outcomes to client stakeholders
Proficiency in Python and experience with data transformation and data engineering best practices
Strong experience with Apache Spark, Apache Kafka, and Google Cloud Platform (GCP)
Hands-on experience with workflow orchestration tools (e.g., Prefect, Airflow)
Demonstrated experience working with large datasets and real-time data processing
Experience building and maintaining ETL/ELT pipelines for analytical or machine learning use cases
Self-motivated, with excellent communication and project ownership skills
Familiarity with financial services data or regulated data environments
Experience with Snowflake or Google BigQuery
Exposure to NLP workflows and data requirements for machine learning models
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Founded in 2014 and headquartered in San Francisco, California, Gigster is a website that allows users to get tech projects built on demand.
3 jobsSubscribe to Rise newsletter