LLAVIDAL: Benchmarking Large Language Vision Models for Daily Activities of Living

Open in new window