Goto

Collaborating Authors

 Law


Fair Play in the Newsroom: Actor-Based Filtering Gender Discrimination in Text Corpora

arXiv.org Artificial Intelligence

Language corpora are the foundation of most natural language processing research, yet they often reproduce structural inequalities. One such inequality is gender discrimination in how actors are represented, which can distort analyses and perpetuate discriminatory outcomes. This paper introduces a user-centric, actor-level pipeline for detecting and mitigating gender discrimination in large-scale text corpora. By combining discourse-aware analysis with metrics for sentiment, syntactic agency, and quotation styles, our method enables both fine-grained auditing and exclusion-based balancing. Applied to the taz2024full corpus of German newspaper articles (1980-2024), the pipeline yields a more gender-balanced dataset while preserving core dynamics of the source material. Our findings show that structural asymmetries can be reduced through systematic filtering, though subtler biases in sentiment and framing remain. We release the tools and reports to support further research in discourse-based fairness auditing and equitable corpus construction.







Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs

Neural Information Processing Systems

To address these issues, we introduced JailTrickBench to evaluate the impact of various attack settings on LLM performance and provide a baseline for jailbreak attacks, encouraging the adoption of a standardized evaluation framework.


Supplementary Materials: FiV A: Fine-grained Visual Attribute Dataset for T ext-to-Image Diffusion Models

Neural Information Processing Systems

Section A. We then introduce additional details on dataset construction in Section B. Further, we Finally, we discuss the limitations and future work of the project in Section D. Please also find the Details on attribute taxonomy and statistics. We visualize the rough distribution of visual attributes and subjects on the left. We also visualize the attribute alignment accuracy via human validation here. Due to space limitations, only 15 sub-subjects are listed for each major-subject. The result shows that Image 4 exhibits inconsistencies, with the reasons provided.



Toward a Well-Calibrated Discrimination via Survival Outcome-A ware Contrastive Learning

Neural Information Processing Systems

Previous deep learning approaches for survival analysis have primarily relied on ranking losses to improve discrimination performance, which often comes at the expense of calibration performance. To address such an issue, we propose a novel contrastive learning approach specifically designed to enhance discrimination without sacrificing calibration.