Research

i-m-Top: Interactive Multi-resolution Tabletop Display

i-m-Top is a rear-projection interactive multi-resolution tabletop system. The aim of the table is for a user to perform regular and long-term works such as reading and writing of documents. The design of the tabletop system well meets the following concerns:

(1) provide movable high-resolution perception of the tabletop,
(2) avoid the shadowing effect by rear projection,
(3) reserve space for the user to place their legs with comfort,
(4) allow simple interactions on the tabletop by bare hands and a pen.


Links
[ACMMM2010 short paper]     [SIG CHI 2009 paper]     [IEEE ITS 2008 paper]     [Video]

Enabling Beyond Surface Interactions

Our vision is to enable a spatially-aware multi-display and multi-touch tabletop system. By knowing the 6D positions of the mobile display devices with respect to the tabletop display, users are able to view additional information with mobile displays. One of our proposed methods shows a programmable infrared (IR) technique that enables mobile devices to interact with display surfaces, while supporting multi-touch and direct on-surface tangible input. Current interactive surfaces are designed to sense 2D interactions such as finger touches. The ability to interact beyond the surface makes it possible to support cooperative multi-display interaction. For example, an architect may examine a 2D blueprint of a building shown on the display while inspecting 3D views of the same structure by moving a mobile display above the display surface.


Links
[UIST 2010 paper]     [Video]

Tubular Interactive Multi-Resolution Display System

We propose i-m-Tube as a tubular interface which provides multiple users interaction with multimedia content by multi-touch and multi-resolution display. With the tubular surface of i-m-Tube, it is suitable for displaying panoramic image content like Chinese scroll painting “Along the River During the Ch'ing Ming Festival” which has been regarded as a national treasure and widely known by its extraordinary width and its various details. The strength of our system is not just suitable for displaying panoramic content, but also possible to create an intuitive and natural context for multi-user interaction with multimedia content displayed on i-m-Tube by touch.


Links
[ACM MM 2009 paper]

Magic Crystal Ball

Magic Ball is a spherical display system, which allows the users to see a virtual object/scene appearing inside a transparent sphere, and to manipulate the displayed content with barehanded interactions. Magic Crystal Ball provides the users to perform touch and hover interactions by their bare hands. The user can wave hands above the ball, followed by computer-generated clouds blowing from bottom of the ball quickly surrounding the displayed content. When the user slide fingers on the ball, the viewing direction of the displayed content is changed accordingly. Magic Crystal Ball takes advantages on the impressions to crystal balls, allowing for the users acting with visual media following their imaginations.


Links
[VRST 2007 paper]

Panorama-Based Interacting with the Physical Environment

In this project, we present an intuitive user interface for interacting with the physical environment. A panorama of the environment is displayed on the handheld device equipped with an orientation sensor to align the panoramic views with the real world. A user can interact with objects within the environment by selecting the corresponding items on the display. Perceiving the same scene on both the handheld and the real world, users can select items quickly and accurately. Furthermore, the panorama is augmented with additional information to enhance better understanding of the environment. The simple yet intuitive selection scheme allows users to exchange information in the physical environment or to control devices such as consumer appliances. A prototype system has been implemented to demonstrate the system in e-home and museum guide applications.


Links
[UbiComp 2005 paper]

e-Fovea: Large-Scale and High-Resolution Monitoring System

e-Fovea, a system that combines both multi-resolution camera input and multi-resolution steerable projector output to support large-scale and high-resolution visual monitoring. e-Fovea utilizes a design similar to the human eyes, which provides peripheral vision with a steerable fovea that is in higher resolution. e-Fovea is implemented using a steerable telephoto camera and a wide-angle camera. The telephoto image is displayed using a projector with a steerable mirror, and overlaid on the wide-angle image that is displayed using a second projector.


Links
[ACM MM 2010 paper]      [Video]

Egocentric View Transition

To monitor multiple views in the control center, generally, users have to choose one of multiple video streams to pay attention to; when switching from one view to another, the flash view change lets users hardly understand the geometry between real cameras. In this project, we propose a novel approach to egocentric view transition, which synthesizes the virtual views during the period of switching cameras, to ease the mental effort for users to understand the events. An important property of our system is that it can be applied to the situations of where the view fields of transition cameras are not close enough or even exclusive. Such situations have never been taken into consideration in the state-of-the-art view transition techniques, to our best knowledge.


Links
[MMM 2011 paper]      [IUI 2009 paper]      [Video]

Target Tracking across Multiple Cameras

We have developed an adaptive learning method for tracking targets across multiple cameras with disjoint filed of views. There are usually two visual cues employed for tracking targets across cameras: spatial-temporal cue and appearance cue. To learn the relationships among cameras, traditional methods used batch-learning procedures or hand-labeled correspondence, which can work well only within a short period of time. In our system, we propose an unsupervised method which learns spatial-temporal relationships, appearance relationships, and the fusion weights adaptively and can be applied to long-term monitoring.


Links
[ICPR 2010 paper]      [CVPR 2008 paper]      [Video]

Face Recognition Based on Facial Trait Code

We propose the Facial Trait Code (FTC) to encode human facial images. A given face can be encoded at some prescribed facial traits to render an n-ary facial trait code with each symbol in its codeword corresponding to the closest Distinctive Trait Patterns (DTP). In order to handle the most rigorous face recognition scenario in which only one facial image per individual is available for enrollment and face variations caused by illumination, expression, pose or misalignment. We propose the Probabilistic Facial Trait Code (PFTC), with a novel encoding scheme and a probabilistic codeword distance measure. We also propose the Pattern-Specific Subspace Learning (PSSL) scheme that encodes and recognizes faces robustly under aforementioned variations.


Links
[ECCV 2010 paper]      [CVPR 2009 paper]