Appendix A Details of Modeling
–Neural Information Processing Systems
We retrieve top 10 passages and use them as input to mGEN. Gettysburg College, where he was a member of the Lambda Chi Alpha fraternity. We further subsample 50% of the synthetically generated questions. For our multilingual retriever, we split each article into 100-token chunks (Karpukhin et al., 2020), The original passage text file is 29GB, and the total index size is around 129 GB. Both two datasets are under the MIT licence.
Neural Information Processing Systems
Nov-13-2025, 23:28:22 GMT
- Country:
- Asia > China
- Hong Kong (0.05)
- Europe > Finland
- North America > United States
- California > San Bernardino County
- San Bernardino (0.04)
- New York > Oneida County
- Utica (0.04)
- California > San Bernardino County
- Asia > China
- Industry:
- Leisure & Entertainment (0.93)
- Media > Film (0.46)
- Technology: