CalliRewrite:

Abstract

Human-like planning skills and dexterous manipulation have long posed challenges in the felds of robotics and artifcial intelligence (AI). The task of reinterpreting calligraphy presents a formidable challenge, as it involves the decomposition of strokes and dexterous utensil control. Previous efforts have primarily focused on supervised learning of a single instrument, limiting the performance of robots in the realm of cross-domain text replication.

To address these challenges, we propose CalliRewrite: a coarse-to-fne approach for robot arms to discover and recover plausible writing orders from diverse calligraphy images without requiring labeled demonstrations. Our model achieves fne-grained control of various writing utensils. Specifcally, an unsupervised image-to-sequence model decomposes a given calligraphy glyph to obtain a coarse stroke sequence. Using an RL algorithm, a simulated brush is fnetuned to generate stylized trajectories for robotic arm control. Evaluation in simulation and physical robot scenarios reveals that our method successfully replicates unseen fonts and styles while achieving integrity in unknown characters.

Methodology

We focus on unsupervised learning from plain images and discovering dexterous control over various utensils. Our method employs a hierarchical structure encompassing a CNN-encoded LSTM model to deduce stroke-level orders, and a reinforcement learning (RL) pipeline to fine-tune the coarse sequences into tool-aware stylized control, controlling the brush agent with soft-actor-critic (SAC) algorithm.

An overview of the training process of CalliRewrite pipeline. We develop our coarse sequence extraction module based on Mo et al. and propose tailored unsupervised loss functions for human-like glyph decomposition. In the second phase, we formulate the task into a constrained optimization task, leveraging SAC algorithm on our crafted environment to finetune dexterous control sequences. Coarse sequences are set into initial states to curtail ineffective exploration and boost training.

Real-World Experiments

CalliRewrite can inference plausible stroke orders and generate controls across diverse scripts.

Robotic Demonstration

Rewriting on various Chinese scripts: the Oracle (left, up); Seal (right, up); Clerical (left, mid); Regular (right, mid); running (left, down) and cursive (right, down). Each pair consists of an input glyph (left) and a robot-generated result (right).

Rewriting formal and cursive English scripts with fude pen. Each pair includes an input glyph (left) and a robot arm replica (right).

Results on Ancient Egyptian (left) and Tamil (right) characters replicated by Dobot Magician robot arm.

Rewrite on Same Glyphs

Rewriting on Chinese Characters. CalliRewrite controls the fude pen to master the handling of brush strokes in chinese characters, including the starting, ending, and pause strokes.

Rewriting on English Characters. CalliRewrite replicates variations in stroke thickness and controls the beginnings and endings of horizontal strokes with a Chinese calligraphy brush.