kwok
The Chinese AI app sending Hollywood into a panic
A new artificial intelligence (AI) model developed by the Chinese company behind TikTok rocked Hollywood this week - not just because of what it can do, but what it could mean for creative industries. Created by tech giant ByteDance, Seedance 2.0 can generate cinema-quality video, complete with sound effects and dialogue, from just a few written prompts. Many of the clips said to have been made using Seedance, and featuring popular characters like Spider-Man and Deadpool, went viral. What is Seedance - and why the stir? Seedance was launched to little fanfare in June 2025 but it is the second version that came eight months later that has caused a major stir.
Sharing AI tech to make world an inclusive place
At 25, Ms Annabelle Kwok has already made a name for herself. Two years ago, she made waves when she co-founded SmartCow, an artificial intelligence (AI) company that came up with an electronic board that could run various AI software. Last year, Ms Kwok left SmartCow and started NeuralBay, a company that focuses on detection and recognition software related to humans, objects and text, and offers AI-driven solutions for multinational corporations. Her clients include an aviation corporation, an automation industry company and chocolate company Ferrero. Ms Kwok traces her interest in tech to a box of Lego bricks with electric cables called Lego Mindstorms, which her parents, who work in banking, bought for her when she was in primary school.
Robust Semi-Supervised Learning through Label Aggregation
Yan, Yan (University of Technology Sydney) | Xu, Zhongwen (University of Technology Sydney) | Tsang, Ivor W. (University of Technology Sydney) | Long, Guodong (University of Technology Sydney) | Yang, Yi (University of Technology Sydney)
Semi-supervised learning is proposed to exploit both labeled and unlabeled data. However, as the scale of data in real world applications increases significantly, conventional semi-supervised algorithms usually lead to massive computational cost and cannot be applied to large scale datasets. In addition, label noise is usually present in the practical applications due to human annotation, which very likely results in remarkable degeneration of performance in semi-supervised methods. To address these two challenges, in this paper, we propose an efficient RObust Semi-Supervised Ensemble Learning (ROSSEL) method, which generates pseudo-labels for unlabeled data using a set of weak annotators, and combines them to approximate the ground-truth labels to assist semi-supervised learning. We formulate the weighted combination process as a multiple label kernel learning (MLKL) problem which can be solved efficiently. Compared with other semi-supervised learning algorithms, the proposed method has linear time complexity. Extensive experiments on five benchmark datasets demonstrate the superior effectiveness, efficiency and robustness of the proposed algorithm.
Cost-Sensitive Semi-Supervised Support Vector Machine
Li, Yu-Feng (Nanjing University, China) | Kwok, James T. (Hong Kong University of Science and Technology) | Zhou, Zhi-Hua (Nanjing University, China)
In this paper, we study cost-sensitive semi-supervised learning where many of the training examples are unlabeled and different misclassification errors are associated with unequal costs. This scenario occurs in many real-world applications. For example, in some disease diagnosis, the cost of erroneously diagnosing a patient as healthy is much higher than that of diagnosing a healthy person as a patient. Also, the acquisition of labeled data requires medical diagnosis which is expensive, while the collection of unlabeled data such as basic health information is much cheaper. We propose the CS4VM (Cost-Sensitive Semi-Supervised Support Vector Machine) to address this problem. We show that the CS4VM, when given the label means of the unlabeled data, closely approximates the supervised cost-sensitive SVM that has access to the ground-truth labels of all the unlabeled data. This observation leads to an efficient algorithm which first estimates the label means and then trains the CS4VM with the plug-in label means by an efficient SVM solver. Experiments on a broad range of data sets show that the proposed method is capable of reducing the total cost and is computationally efficient.