An foreign-invested technology company in Hanoi is recruiting a Data Engineer, working remotely. You will join the teams to serve clients from European countries. The job requires solid experience in data scraping using Python/Scrapy, data ETL, big data processing and good English communication skills. You wil report directly to the CEO(an expat).
To succeed in this job, you will have solid experience in Data ETL, knowledge of data science, experience with cloud platforms, and deploying cloud-based applications.
Responsibilities:
- Data collection: Efficient collection of large data sets either directly from clients through API access or through web scraping techniques (Scrapy).
- Data conversion: Converting complex formats such as PDFs into structured Markdown, processing textual and visual data extraction.
- Data processing and indexing: Optimize data for AI model ingestion and manage the indexing process to ensure data is searchable and used effectively in model training.
- Cloud and Infrastructure: Utilize Google cloud services to deploy and manage applications with a focus on automating data workflows and scalability of our cloud architecture.
Requirements:
- Strong proficiency in Python , especially for data-intensive applications. Knowledge of data science principles to process and prepare large amounts of data.
- Experience with cloud platforms, ideally Google Cloud, including cloud functions and storage solutions.
- Experienced in developing and deploying cloud-based tools, with a strong understanding of infrastructure automation and CI/CD practices.
- Good written and spoken English communication skills are essential.
- EU time zone alignment - for working hours.
Salary & Benefits:
- Basic salary: competitive(open to discuss)
- 13th month salary + other benefits
If you would like to apply for the job or find out more, please contact Hieu Nguyen (Henry) at TECHSPACES on