Research Engineer: Superhuman Visual Generation

Atman Labs, London

About Atman Labs
At Atman Labs we are building software to emulate proactive human expertise. Emulating human experts with deep knowledge and proactive assistance has largely been impossible to do via standalone Artificial Intelligence techniques. As an applied research and commercialization company we are deploying our products in a number of domains to demonstrate the value of our approach – from proactive shopping assistance, to personal teachers to healthcare concierges – and with this commercial focus advance our unique research that lies at the intersection of Reinforcement Learning rewards, Large Scale Knowledge Representation, and Predictive Models inspired by biological priors.

The Next Frontier of Visual Foundation Models: Unlocking Human Engagement and New Consumer Behavior with Advanced Generative Interactions
We are hiring for a founding research engineer responsible for designing and implementing pipelines to introduce superhuman and imaginative capabilities into our system that unlock complex commercial opportunities. This will involve deploying and fine-tuning an ensemble of state-of-the-art visual generation techniques and models, such as GANs and diffusion models. Human imagination is critical as the connective tissue between stored knowledge in the brain and vocalization in speech, using media like language, art, and sound to share and receive ideas. Our intelligent agents have a number of ways they can interact with the world and other humans – they can imagine and communicate not just with language, but also by synthesizing immersive images that serve the task at hand.

As such, we seek to explore the frontiers of how agents can present information to humans across cutting-edge visual techniques, going beyond what human imagination and existing AI approaches can achieve in form factor, latency, and experience. For example, in our first commercial product, we are building personal shopping agents that have the ability to imagine what a human would look like wearing a new outfit instead of just describing it (“AI try-on”). Eventually, we can also imagine and create new visual mediums, such as video avatars, to interact with agents across any vertical.

You will be responsible for the cutting edge deployment and fine-tuning of visual-generation algorithms and familiar with ways that they can be more performant and fine-tuned to specific use cases. You ideally have technical experience or intuition to work with generative models, have a deep understanding of the current state of the art across different types of GANs, Diffusion Models, NeRFs, and other Visual Foundation Models, and are excited to journey at the cutting-edge of the interaction layer our agents have with the world. You will also be able to come up with ideas for multi-step pipelines which bring together different algorithms, knowing when to focus on both quality and computational efficiency trade-offs.

About You
We are looking for ambitious and independent thinkers who have a deep desire to contribute and want to be part of the team that makes this a reality for humanity. In order to contribute, you should have all of these qualities:

You have a PhD in Computer Vision or equivalent industrial expertise in the training, deployment, and fine-tuning of visual generative models, including GANs, NeRFs, and diffusion models.
You are able to deeply evaluate the state of the art of various image-generation tasks relevant to a current commercial problem, like the aforementioned ‘AI try-on’ (source), deeply understanding various architectural components of existing pipelines, and hence why they achieve certain results. From first-principles, research, and intuition, you can generate your own ideas for a pipeline that can be cutting edge, too.
You have a strong creative imagination and high visual editorial standards for presentation of generated objects, which complements your technical intuition.
You consider yourself both a hacker and a painter.
You are eager to process and analyze large amounts of visual data.
You are intimately familiar with the nuances for fine-tuning or optimizing generative models for certain types of behaviors or subtasks, such as ControlNet for diffusion (source).
You understand and can critically communicate about state-of-the art tools and frameworks to optimize model performance, including DeepSpeed, LoRA, 3D parallelism, or quantization.
You have 7+ years of programming experience in Python and have development experience with both DL toolkits like PyTorch or Tensorflow and can deploy models with clean APIs. You are equally capable as a software engineer as you are in formulating novel research ideas and your code proves it.

Moreover, in order to deeply fit within our culture, you should embody the following:

You are capable of reasoning from first-principles, where there is no trodden path, as well as critically evaluate when existing ideas are worth considering.
You are articulate and can present your ideas in writing, in person, and in small groups, and are able to educate audiences at all levels on the novel applications and relevance of reinforcement learning.
You can easily distinguish authentic and high integrity thinkers from ‘posers’, while also critically evaluating truth from fiction in your own work.
Your colleagues consider you a highly positive personality, you amplify the energy of others rather than dampen the mood.
Your intensity goes from 0 to 1000 when you become authentically interested in a topic.
You not only have interests in reinforcement learning, but are deeply curious about a range of interdisciplinary topics, ranging from knowledge graphs, recommendations, web-scale search, deep learning, generative AI models, computer vision and the opportunity to build truly intelligent systems in software that are inspired by biology.
You can show high creativity and intensity in your personal pursuits, and your intelligence, creativity, and motivation is not limited to only one discipline.
You consider yourself an innovator and an original thinker, not a follower. You are looking for a way to contribute to the world, and want to join our team to do so.
You want to work in person in London. Don’t worry, we’ll sponsor your visa.

We’re excited to meet you. If you are too, send a short message with a list of your projects and highlights, as well as a brief paragraph of your life’s story, to shravan@atmanlabs.ai.