Breakout Session [Hall 326B]
 in 
Affinity Workshop: 4th Women in Machine Learning (WiML) Un-Workshop
                        
                    
                    Data Diversity and Downstream impact
Judy Hanwen Shen · Paula Gradu
                        Abstract:
                        
                            
                    
                Discussion Questions
- How does data quality affect downstream trustworthy ML desiderata such as fairness, robustness, and explain- ability? 
- How do we measure data diversity and quantify the impacts of a lack of data diversity in downstream discriminative and generative tasks? 
- How do we weigh the importance of fairness and performance across training and deployment distributions in the presence of distribution shifts? 
- What common machine learning datasets are designed around features with may not predict the target label? How can we interrogate questions of predictability before developing algorithmic techniques for a particular task? 
Chat is not available.