
Modal Labs is a platform that accelerates development, reduces costs, and scales workloads for data science and machine learning teams using generative AI models and serverless GPUs. Our pay-per-use model ensures efficient resource utilization, allowing users to access powerful computing power on demand without idle costs. Modal offers a wide range of services including LLM inference and fine-tuning, generative model training and inference, and computational biology, among others.
Modal is building a serverless compute platform to support AI companies and is seeking engineers to optimize workloads, negotiate with cloud vendors, and design improvements to their scheduling system. The role involves system-wide optimization, from GPU costs to product offerings, with a focus on finding new and efficient solutions.
Modal Labs is a platform that accelerates development, reduces costs, and scales workloads for data science and machine learning teams using generative AI models and serverless GPUs. Our pay-per-use model ensures efficient resource utilization, allowing users to access powerful computing power on demand without idle costs. Modal offers a wide range of services including LLM inference and fine-tuning, generative model training and inference, and computational biology, among others.