Lessons from Archives: Strategies for Collecting Sociocultural Data in Machine Learning