Goto

Collaborating Authors

 amenity


Grounded Persuasive Language Generation for Automated Marketing

Wu, Jibang, Yang, Chenghao, Mahns, Simon, Wang, Chaoqi, Zhu, Hao, Fang, Fei, Xu, Haifeng

arXiv.org Artificial Intelligence

This paper develops an agentic framework that employs large language models (LLMs) to automate the generation of persuasive and grounded marketing content, using real estate listing descriptions as our focal application domain. Our method is designed to align the generated content with user preferences while highlighting useful factual attributes. This agent consists of three key modules: (1) Grounding Module, mimicking expert human behavior to predict marketable features; (2) Personalization Module, aligning content with user preferences; (3) Marketing Module, ensuring factual accuracy and the inclusion of localized features. We conduct systematic human-subject experiments in the domain of real estate marketing, with a focus group of potential house buyers. The results demonstrate that marketing descriptions generated by our approach are preferred over those written by human experts by a clear margin. Our findings suggest a promising LLM-based agentic framework to automate large-scale targeted marketing while ensuring responsible generation using only facts.


Predicting House Rental Prices in Ghana Using Machine Learning

Adzanoukpe, Philip

arXiv.org Artificial Intelligence

The housing market in Ghana has been facing significant challenges, with the rental sector being particularly affected by issues such as the advance rent system, asymmetrical perceptions between landlords and tenants, and the lack of an institutional framework for regulating the market [2]. These challenges create a highly dynamic and often opaque rental environment, where both tenants and landlords face difficulties in determining fair rental prices. This issue is further exacerbated by the absence of comprehensive and up-to-date data on rental trends, making it challenging for stakeholders to make informed decisions. In recent years, the use of machine learning in real estate has gained traction globally as a means to address such challenges. Machine learning (ML) models can analyse large datasets, uncover hidden patterns, and make accurate predictions, thereby providing valuable insights for various stakeholders in the housing market.


Urban context and delivery performance: Modelling service time for cargo bikes and vans across diverse urban environments

Schrader, Maxwell, Kumar, Navish, Sørig, Esben, Yoon, Soonmyeong, Srivastava, Akash, Xu, Kai, Astefanoaei, Maria, Collignon, Nicolas

arXiv.org Artificial Intelligence

Light goods vehicles (LGV) used extensively in the last mile of delivery are one of the leading polluters in cities. Cargo-bike logistics and Light Electric Vehicles (LEVs) have been put forward as a high impact candidate for replacing LGVs. Studies have estimated over half of urban van deliveries being replaceable by cargo-bikes, due to their faster speeds, shorter parking times and more efficient routes across cities. However, the logistics sector suffers from a lack of publicly available data, particularly pertaining to cargo-bike deliveries, thus limiting the understanding of their potential benefits. Specifically, service time (which includes cruising for parking, and walking to destination) is a major, but often overlooked component of delivery time modelling. The aim of this study is to establish a framework for measuring the performance of delivery vehicles, with an initial focus on modelling service times of vans and cargo-bikes across diverse urban environments. We introduce two datasets that allow for in-depth analysis and modelling of service times of cargo bikes and use existing datasets to reason about differences in delivery performance across vehicle types. We introduce a modelling framework to predict the service times of deliveries based on urban context. We employ Uber's H3 index to divide cities into hexagonal cells and aggregate OpenStreetMap tags for each cell, providing a detailed assessment of urban context. Leveraging this spatial grid, we use GeoVex to represent micro-regions as points in a continuous vector space, which then serve as input for predicting vehicle service times. We show that geospatial embeddings can effectively capture urban contexts and facilitate generalizations to new contexts and cities. Our methodology addresses the challenge of limited comparative data available for different vehicle types within the same urban settings.


Enhancing Incremental Summarization with Structured Representations

Hwang, EunJeong, Zhou, Yichao, Wendt, James Bradley, Gunel, Beliz, Vo, Nguyen, Xie, Jing, Tata, Sandeep

arXiv.org Artificial Intelligence

Large language models (LLMs) often struggle with processing extensive input contexts, which can lead to redundant, inaccurate, or incoherent summaries. Recent methods have used unstructured memory to incrementally process these contexts, but they still suffer from information overload due to the volume of unstructured data handled. In our study, we introduce structured knowledge representations ($GU_{json}$), which significantly improve summarization performance by 40% and 14% across two public datasets. Most notably, we propose the Chain-of-Key strategy ($CoK_{json}$) that dynamically updates or augments these representations with new information, rather than recreating the structured memory for each new source. This method further enhances performance by 7% and 4% on the datasets.


QDA-SQL: Questions Enhanced Dialogue Augmentation for Multi-Turn Text-to-SQL

Sun, Yinggang, Guo, Ziming, Yu, Haining, Liu, Chuanyi, Li, Xiang, Wang, Bingxuan, Yu, Xiangzhan, Zhao, Tiancheng

arXiv.org Artificial Intelligence

Fine-tuning large language models (LLMs) for specific domain tasks has achieved great success in Text-to-SQL tasks. However, these fine-tuned models often face challenges with multi-turn Text-to-SQL tasks caused by ambiguous or unanswerable questions. It is desired to enhance LLMs to handle multiple types of questions in multi-turn Text-to-SQL tasks. To address this, we propose a novel data augmentation method, called QDA-SQL, which generates multiple types of multi-turn Q\&A pairs by using LLMs. In QDA-SQL, we introduce a novel data augmentation method incorporating validation and correction mechanisms to handle complex multi-turn Text-to-SQL tasks. Experimental results demonstrate that QDA-SQL enables fine-tuned models to exhibit higher performance on SQL statement accuracy and enhances their ability to handle complex, unanswerable questions in multi-turn Text-to-SQL tasks. The generation script and test set are released at https://github.com/mcxiaoxiao/QDA-SQL.


Zero-shot Sequential Neuro-symbolic Reasoning for Automatically Generating Architecture Schematic Designs

Kodnongbua, Milin, Curtis, Lawrence H., Schulz, Adriana

arXiv.org Artificial Intelligence

This paper introduces a novel automated system for generating architecture schematic designs aimed at streamlining complex decision-making at the multifamily real estate development project's outset. Leveraging the combined strengths of generative AI (neuro reasoning) and mathematical program solvers (symbolic reasoning), the method addresses both the reliance on expert insights and technical challenges in architectural schematic design. To address the large-scale and interconnected nature of design decisions needed for designing a whole building, we proposed a novel sequential neuro-symbolic reasoning approach, emulating traditional architecture design processes from initial concept to detailed layout. To remove the need to hand-craft a cost function to approximate the desired objectives, we propose a solution that uses neuro reasoning to generate constraints and cost functions that the symbolic solvers can use to solve. We also incorporate feedback loops for each design stage to ensure a tight integration between neuro and symbolic reasoning. Developed using GPT-4 without further training, our method's effectiveness is validated through comparative studies with real-world buildings. Our method can generate various building designs in accordance with the understanding of the neighborhood, showcasing its potential to transform the realm of architectural schematic design.


City Foundation Models for Learning General Purpose Representations from OpenStreetMap

Balsebre, Pasquale, Huang, Weiming, Cong, Gao, Li, Yi

arXiv.org Artificial Intelligence

Pre-trained Foundation Models (PFMs) have ushered in a paradigm-shift in Artificial Intelligence, due to their ability to learn general-purpose representations that can be readily employed in a wide range of downstream tasks. While PFMs have been successfully adopted in various fields such as Natural Language Processing and Computer Vision, their capacity in handling geospatial data and answering urban questions remains limited. This can be attributed to the intrinsic heterogeneity of geospatial data, which encompasses different data types, including points, segments and regions, as well as multiple information modalities, such as a spatial position, visual characteristics and textual annotations. The proliferation of Volunteered Geographic Information initiatives, and the ever-increasing availability of open geospatial data sources, like OpenStreetMap, which is freely accessible globally, unveil a promising opportunity to bridge this gap. In this paper, we present CityFM, a self-supervised framework to train a foundation model within a selected geographical area of interest, such as a city. CityFM relies solely on open data from OSM, and produces multimodal representations of entities of different types, incorporating spatial, visual, and textual information. We analyse the entity representations generated using our foundation models from a qualitative perspective, and conduct quantitative experiments on road, building, and region-level downstream tasks. We compare its results to algorithms tailored specifically for the respective applications. In all the experiments, CityFM achieves performance superior to, or on par with, the baselines.


Age-Friendly Route Planner: Calculating Comfortable Routes for Senior Citizens

Aranguren, Andoni, Osaba, Eneko, Urra-Uriarte, Silvia, Molina-Costa, Patricia

arXiv.org Artificial Intelligence

The application of routing algorithms to real-world situations is a widely studied research topic. Despite this, routing algorithms and applications are usually developed for a general purpose, meaning that certain groups, such as ageing people, are often marginalized due to the broad approach of the designed algorithms. This situation may pose a problem in cities which are suffering a slow but progressive ageing of their populations. With this motivation in mind, this paper focuses on describing our implemented Age-Friendly Route Planner, whose goal is to improve the experience in the city for senior citizens. In order to measure the age-friendliness of a route, several variables have been deemed, such as the number of amenities along the route, the amount of comfortable elements found, or the avoidance of sloppy sections. In this paper, we describe one of the main features of the Age-Friendly Route Planner: the preference-based routes, and we also demonstrate how it can contribute to the creation of adapted friendly routes.


Text-to-OverpassQL: A Natural Language Interface for Complex Geodata Querying of OpenStreetMap

Staniek, Michael, Schumann, Raphael, Züfle, Maike, Riezler, Stefan

arXiv.org Artificial Intelligence

We present Text-to-OverpassQL, a task designed to facilitate a natural language interface for querying geodata from OpenStreetMap (OSM). The Overpass Query Language (OverpassQL) allows users to formulate complex database queries and is widely adopted in the OSM ecosystem. Generating Overpass queries from natural language input serves multiple use-cases. It enables novice users to utilize OverpassQL without prior knowledge, assists experienced users with crafting advanced queries, and enables tool-augmented large language models to access information stored in the OSM database. In order to assess the performance of current sequence generation models on this task, we propose OverpassNL, a dataset of 8,352 queries with corresponding natural language inputs. We further introduce task specific evaluation metrics and ground the evaluation of the Text-to-OverpassQL task by executing the queries against the OSM database. We establish strong baselines by finetuning sequence-to-sequence models and adapting large language models with in-context examples. The detailed evaluation reveals strengths and weaknesses of the considered learning strategies, laying the foundations for further research into the Text-to-OverpassQL task.


ST-RAP: A Spatio-Temporal Framework for Real Estate Appraisal

Lee, Hojoon, Jeong, Hawon, Lee, Byungkun, Lee, Kyungyup, Choo, Jaegul

arXiv.org Artificial Intelligence

Recent studies have attempted to address this limitation by adopting graph neural networks to model spatial relationships between In this paper, we introduce ST-RAP, a novel Spatio-Temporal framework properties [4, 14, 35]. These models represent spatial relationships for Real estate APpraisal. ST-RAP employs a hierarchical as a graph, with each node denoting a property. For example, in architecture with a heterogeneous graph neural network to encapsulate MugRep [35], nodes are connected based on geographical proximity, temporal dynamics and spatial relationships simultaneously.