Senior Machine Learning Data Engineer

at

Photoroom

Worldwide
Full Time
2mo ago
✨ About us ✨
PhotoRoom develops cutting-edge technology that empowers entrepreneurs, small businesses, and merchants to easily create images that sell - leveraging deep learning. Our ambition: power the internet’s commerce images.

Our first consumer product, the PhotoRoom app, is already a leader in mobile photo editing: we serve 100M+ users across more than 180 countries monthly, and our app was named Editor’s Choice by Apple and Google. We launched our API in 2023 as well, allowing us to deploy our tech at scale through a B2B motion.

We are at an exciting stage of our journey, having successfully raised our Series A and looking to scale to $100M ARR in 2024.

Our company is profitable, and our lean team is made of world-class experts in deep learning, product, and marketing with backgrounds at Apple, Algolia, or Google. We are a diverse team of entrepreneurs building for entrepreneurs.

TL;DR
PhotoRoom is looking for a talented senior ML data engineer. You will work on the data backbone of all the machine learning efforts at PhotoRoom, including training dedicated models, using multimodal data and, when applicable, managing data annotation. You will own a large scope and create large dynamic training sets, in the 100M to 1B image-text pairs scale. You will develop novel methods to track and learn about data distribution, and adapt it through the ingestion pipeline so that it best suit our needs. Your work will help a tightly knit team and serve millions of users; it will be useful and remarked.

🇪🇺 We are flexible: you can work from anywhere in Europe and come once a month in Paris (fully reimbursed), or come to the office more often.
✈️ We offer substantial support for relocation (10k€ signing bonus), including finding an apartment in Paris and supporting you with the visa procedure.
💻 Technology - new MacBook Pro, monitor, keyboard, etc.
🏖️ Socials - Quarterly company retreats, weekly Happy Hour & Game Time
🇬🇧 PhotoRoom is an international team and we work in English. We offer language lessons for those who need them (English & French).

✨ About the role ✨

    • You'll be joining a small team of less than 10 people, working alongside great talent from various backgrounds and contributing to world class results in the ML space.
    • You will have full ownership of your ML work streams when applicable, and yet collaborate closely with your team mates. You have expertise, you have a say in how that goes.
    • You will master the whole data processing pipeline, which leads to training sets creation and improvements. You will train and use ML models at scale as the needs arise and be an architect and builder in the ML team. To give some numbers, we currently manage north of 100M images, and have processed in various forms around 1B images in total.
    • You will get fast feedback from our team members and will iterate quickly. You will follow the usage of the product and make decisions based on that.
    • You will work within a small multicultural team composed of ~50 passionate, friendly & committed folks.

✨ About you ✨

    • You have experience with managing data at scale (10M+ images, 1T+ tokens), including storage, data lifecycle, analytics, visualization and online processing.
    • You can design and write a data ingestion pipeline using the cloud ecosystem, train models and use them to extract multi-modal information, you’re familiar with GPU processing at scale.
    • You have data science expertise in a multi-modal domain
    • You have a down-to-earth, pragmatic approach and know how to leverage existing resources (frameworks and libraries) to focus on what really matters and ship fast. You favor speed of iteration over perfect, always leverage frameworks and libraries to avoid reinventing the wheel.
    • You have a strong sense of ownership. You take initiative, and you are at ease in making product & technical decisions.
    • You have worked as part of a talented team and have experience in a fast-growing startup.
    • You are open and honest about the current stage of your work. You keep learning in your field, can share with your peers, and learn from them.
    • You are fluent in English (French is not required!)

Other elements we will value

    • You have worked as part of a talented team and have experience in a fast-growing startup.
    • You have experience with: large-scale training, dynamic (ever-changing) datasets, multimodal datasets, managing data labeling efforts
🌈 Diversity, Equity, Inclusion and Belonging
We are committed to enabling everyone to feel included and valued at the workplace. We believe both the company and its culture are strongest when composed of diverse experiences and backgrounds.
That's why:
- We have flexible working hours
- We trust people to work remotely
- We extended the length of the parental leave
All qualified applicants will receive consideration for employment without regard to age, color, family, gender identity, marital status, national origin, physical or mental disability, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws.
Apply for this job

Click on apply will take you to the actual job site or will open email app.

Click above box to copy link
Copied
Get exclusive remote work stories and fresh remote jobs, weekly 👇
View all remote jobs
Onkar By: Onkar