Filipowicz, Alex
ConjointNet: Enhancing Conjoint Analysis for Preference Prediction with Representation Learning
Zhang, Yanxia, Chen, Francine, Hakimi, Shabnam, Harinen, Totte, Filipowicz, Alex, Chen, Yan-Ying, Iliev, Rumen, Arechiga, Nikos, Murakami, Kalani, Lyons, Kent, Wu, Charlene, Klenk, Matt
Understanding consumer preferences is essential to product design and predicting market response to these new products. Choice-based conjoint analysis is widely used to model user preferences using their choices in surveys. However, traditional conjoint estimation techniques assume simple linear models. This assumption may lead to limited predictability and inaccurate estimation of product attribute contributions, especially on data that has underlying non-linear relationships. In this work, we employ representation learning to efficiently alleviate this issue. We propose ConjointNet, which is composed of two novel neural architectures, to predict user preferences. We demonstrate that the proposed ConjointNet models outperform traditional conjoint estimate techniques on two preference datasets by over 5%, and offer insights into non-linear feature interactions.
Using LLMs to Model the Beliefs and Preferences of Targeted Populations
Namikoshi, Keiichi, Filipowicz, Alex, Shamma, David A., Iliev, Rumen, Hogan, Candice L., Arechiga, Nikos
We consider the problem of aligning a large language model (LLM) to model the preferences of a human population. Modeling the beliefs, preferences, and behaviors of a specific population can be useful for a variety of different applications, such as conducting simulated focus groups for new products, conducting virtual surveys, and testing behavioral interventions, especially for interventions that are expensive, impractical, or unethical. Existing work has had mixed success using LLMs to accurately model human behavior in different contexts. We benchmark and evaluate two well-known fine-tuning approaches and evaluate the resulting populations on their ability to match the preferences of real human respondents on a survey of preferences for battery electric vehicles (BEVs). We evaluate our models against their ability to match population-wide statistics as well as their ability to match individual responses, and we investigate the role of temperature in controlling the trade-offs between these two. Additionally, we propose and evaluate a novel loss term to improve model performance on responses that require a numeric response.