calligraphy
Benchmarking Foundation Models on Exceptional Cases: Dataset Creation and Validation
Kang, Suho, Park, Jungyang, Ha, Joonseo, Kim, SoMin, Kim, JinHyeong, Park, Subeen, Song, Kyungwoo
Foundation models (FMs) have achieved significant success across various tasks, leading to research on benchmarks for reasoning abilities. However, there is a lack of studies on FMs performance in exceptional scenarios, which we define as out-of-distribution (OOD) reasoning tasks. This paper is the first to address these cases, developing a novel dataset for evaluation of FMs across multiple modalities, including graphic novels, calligraphy, news articles, and lyrics. It includes tasks for instance classification, character recognition, token prediction, and text generation. The paper also proposes prompt engineering techniques like Chain-of-Thought (CoT) and CoT+Few-Shot to enhance performance. Validation of FMs using various methods revealed improvements. The code repository is accessible at: https://github.com/MLAI-Yonsei/ExceptionalBenchmark
- Media > Music (0.68)
- Leisure & Entertainment (0.68)
- Media > News (0.48)
Moyun: A Diffusion-Based Model for Style-Specific Chinese Calligraphy Generation
Liu, Kaiyuan, Mei, Jiahao, Zhang, Hengyu, Zhang, Yihuai, Wu, Xingjiao, Dong, Daoguo, He, Liang
Although Chinese calligraphy generation has achieved style transfer, generating calligraphy by specifying the calligrapher, font, and character style remains challenging. To address this, we propose a new Chinese calligraphy generation model 'Moyun' , which replaces the Unet in the Diffusion model with Vision Mamba and introduces the TripleLabel control mechanism to achieve controllable calligraphy generation. The model was tested on our large-scale dataset 'Mobao' of over 1.9 million images, and the results demonstrate that 'Moyun' can effectively control the generation process and produce calligraphy in the specified style. Even for calligraphy the calligrapher has not written, 'Moyun' can generate calligraphy that matches the style of the calligrapher.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Asia > China > Shanghai > Shanghai (0.05)
- North America > United States > New York > New York County > New York City (0.04)
CalliffusionV2: Personalized Natural Calligraphy Generation with Flexible Multi-modal Control
Liao, Qisheng, Li, Liang, Fei, Yulang, Xia, Gus
From oracle bone script to seal script, from clerical script to standard In this paper, we introduce CalliffusionV2, a novel system script, the evolution of Chinese characters bears witness to designed to produce natural Chinese calligraphy with flexible the development of Chinese culture. This influence extends multi-modal control. Unlike previous approaches that beyond China, impacting other East Asian countries such as rely solely on image or text inputs and lack fine-grained Korea and Japan, where Chinese calligraphy has also played control, our system leverages both images to guide generations a significant role. Despite its historical significance, in modern at fine-grained levels and natural language texts times, mastering calligraphy requires a significant time to describe the features of generations. CalliffusionV2 excels investment that many people today find difficult to accommodate at creating a broad range of characters and can quickly in their busy lives.
CalliRewrite: Recovering Handwriting Behaviors from Calligraphy Images without Supervision
Luo, Yuxuan, Wu, Zekun, Lian, Zhouhui
Human-like planning skills and dexterous manipulation have long posed challenges in the fields of robotics and artificial intelligence (AI). The task of reinterpreting calligraphy presents a formidable challenge, as it involves the decomposition of strokes and dexterous utensil control. Previous efforts have primarily focused on supervised learning of a single instrument, limiting the performance of robots in the realm of cross-domain text replication. To address these challenges, we propose CalliRewrite: a coarse-to-fine approach for robot arms to discover and recover plausible writing orders from diverse calligraphy images without requiring labeled demonstrations. Our model achieves fine-grained control of various writing utensils. Specifically, an unsupervised image-to-sequence model decomposes a given calligraphy glyph to obtain a coarse stroke sequence. Using an RL algorithm, a simulated brush is fine-tuned to generate stylized trajectories for robotic arm control. Evaluation in simulation and physical robot scenarios reveals that our method successfully replicates unseen fonts and styles while achieving integrity in unknown characters.
- Europe > Netherlands > North Holland > Amsterdam (0.04)
- Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)
- Asia > China (0.04)
Archiving Body Movements: Collective Generation of Chinese Calligraphy
Zhou, Aven Le, Ye, Jiayi, Liu, Tianchen, Zhang, Kang
As a communication channel, body movements have been widely explored in behavioral studies and kinesics. Performing and visual arts share the same interests but focus on documenting and representing human body movements, such as for dance notation and visual work creation. This paper investigates body movements in oriental calligraphy and how to apply calligraphy principles to stimulate and archive body movements. Through an artwork (Wushu), the authors experiment with an interactive and generative approach to engage the audience's bodily participation and archive the body movements as a compendium of generated calligraphy. The audience assumes the role of both writers and readers; creating ("writing") and appreciating ("reading") the generated calligraphy becomes a cyclical process within this infinite "Book," which can motivate further attention and discussions concerning Chinese characters and calligraphy.
- North America > United States > Pennsylvania (0.04)
- North America > United States > Illinois > Cook County > Chicago (0.04)
- North America > United States > Hawaii (0.04)
- (6 more...)
- Research Report (0.90)
- Overview (0.54)
End-to-end Manipulator Calligraphy Planning via Variational Imitation Learning
Xie, Fangping, Meur, Pierre Le, Fernando, Charith
Planning from demonstrations has shown promising results with the advances of deep neural networks. One of the most popular real-world applications is automated handwriting using a robotic manipulator. Classically it is simplified as a two-dimension problem. This representation is suitable for elementary drawings, but it is not sufficient for Japanese calligraphy or complex work of art where the orientation of a pen is part of the user expression. In this study, we focus on automated planning of Japanese calligraphy using a three-dimension representation of the trajectory as well as the rotation of the pen tip, and propose a novel deep imitation learning neural network that learns from expert demonstrations through a combination of images and pose data. The network consists of a combination of variational auto-encoder, bi-directional LSTM, and Multi-Layer Perceptron (MLP). Experiments are conducted in a progressive way, and results demonstrate that the proposed approach is successful in completion of tasks for real-world robots, overcoming the distribution shift problem in imitation learning. The source code and dataset will be public.
- North America > United States > Montana (0.04)
- Asia > Japan (0.04)
- Research Report > New Finding (0.54)
- Instructional Material > Course Syllabus & Notes (0.54)
- Information Technology > Artificial Intelligence > Robots (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)
End-to-End Rubbing Restoration Using Generative Adversarial Networks
Sun, Gongbo, Zheng, Zijie, Zhang, Ming
Rubbing restorations are significant for preserving world cultural history. In this paper, we propose the RubbingGAN model for restoring incomplete rubbing characters. Specifically, we collect characters from the Zhang Menglong Bei and build up the first rubbing restoration dataset. We design the first generative adversarial network for rubbing restoration. Based on the dataset we collect, we apply the RubbingGAN to learn the Zhang Menglong Bei font style and restore the characters. The results of experiments show that RubbingGAN can repair both slightly and severely incomplete rubbing characters fast and effectively.
Google's AI assistant has 5 New Year's resolutions for you
Google Assistant and the smart speaker Google Home have some opinions on how to be a better person in 2017. Ask "What should my New Year's resolution be?" and the AI assistant will tell you to do things like write a novel or pick up calligraphy. Here are the five answers to the question Google Assistant gave VentureBeat when we asked earlier today. Nothing earth-shattering or controversial about this ability, but it's not a question Alexa or Siri attempt to answer. You could write a novel or keep a journal.
Aesthetic Visual Quality Evaluation of Chinese Handwritings
Sun, Rongju (Peking University) | Lian, Zhouhui (Peking University) | Tang, Yingmin (Peking University) | Xiao, Jianguo (Peking University)
Aesthetic evaluation of Chinese calligraphy is one of the most challenging tasks in Artificial Intelligence. This paper attempts to solve this problem by proposing a number of aesthetic feature representations and feeding them into Artificial Neural Networks. Specifically, 22 global shape features are presented to describe a given handwritten Chinese character from different aspects according to classical calligraphic rules, and a new 10-dimensional feature vector is introduced to represent the component layout information using sparse coding. Moreover, a Chinese Handwriting Aesthetic Evaluation Database (CHAED) is also built by collecting 1000 Chinese handwriting images with diverse aesthetic qualities and inviting 33 subjects to evaluate the aesthetic quality for each calligraphic image. Finally, back propagation neural networks are constructed with the concatenation of the proposed features as input and then trained on our CHAED database for the aesthetic evaluation of Chinese calligraphy. Experimental results demonstrate that the proposed AI system provides a comparable performance with human evaluation. Through our experiments, we also compare the importance of each individual feature and reveal the relationship between our aesthetic features and the aesthetic perceptions of human beings.
- Information Technology > Artificial Intelligence > Vision (1.00)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.89)