Customer-obsessed science


Research areas
-
March 27, 2025Training separate models on different datasets and then merging them reduces computational costs by as much as 91%.
-
-
Featured news
-
OOPSLA 20252025Software updates, including bug repair and feature additions, are frequent in modern applications but they often leave test suites outdated, resulting in undetected bugs and increased chances of system failures. A recent study by Meta revealed that 14%-22% of software failures stem from outdated tests that fail to reflect changes in the codebase. This highlights the need to keep tests in sync with code
-
NeurIPS 2024 Workshop on Tackling Climate Change with Machine Learning, Environmental Science and Technology2025Accurately quantifying greenhouse gas (GHG) emissions is crucial for organizations to measure and mitigate their environmental impact. Life cycle assessment (LCA) estimates the environmental impacts throughout a product’s entire lifecycle, from raw material extraction to end-of-life. Measuring the emissions outside of a product owner’s control is challenging, and practitioners rely on emission factors (
-
2025Composed Image Retrieval (CIR) is a complex task that aims to retrieve images based on a multimodal query. Typical training data consists of triplets containing a reference image, a textual description of desired modifications, and the target image, which are expensive and time-consuming to acquire. The scarcity of CIR datasets has led to zero-shot approaches utilizing synthetic triplets or leveraging vision-language
-
2025Open domain question answering systems frequently rely on information retrieved from large collections of text (such as the Web) to answer questions. However, such collections of text often contain conflicting information, and indiscriminately depending on this information may result in untruthful and inaccurate answers. To understand the gravity of this problem, we collect a human-annotated dataset, Question
-
AISTATS 20252025We study the well-motivated problem of online distribution shift in which the data arrive in batches and the distribution of each batch can change arbitrarily over time. Since the shifts can be large or small, abrupt or gradual, the length of the relevant historical data to learn from may vary over time, which poses a major challenge in designing algorithms that can automatically adapt to the best “attention
Academia
View allWhether you're a faculty member or student, there are number of ways you can engage with Amazon.
View all