转载

arXiv Paper Daily: Tue, 14 Feb 2017

Neural and Evolutionary Computing

Feature Space Modeling Through Surrogate Illumination

Adam Gaier , Alexander Asteroth , Jean-Baptiste Mouret Subjects : Neural and Evolutionary Computing (cs.NE) ; Computational Engineering, Finance, and Science (cs.CE); Machine Learning (stat.ML)

The MAP-Elites algorithm produces a set of high-performing solutions that

vary according to features defined by the user. This technique has the

potential to be a powerful tool for design space exploration, but is limited by

the need for numerous evaluations. The Surrogate-Assisted Illumination

algorithm (SAIL), introduced here, integrates approximative models and

intelligent sampling of the objective function to minimize the number of

evaluations required by MAP-Elites.

The ability of SAIL to efficiently produce both accurate models and diverse

high performing solutions is illustrated on a 2D airfoil design problem. The

search space is divided into bins, each holding a design with a different

combination of features. In each bin SAIL produces a better performing solution

than MAP-Elites, and requires several orders of magnitude fewer evaluations.

The CMA-ES algorithm was used to produce an optimal design in each bin: with

the same number of evaluations required by CMA-ES to find a near-optimal

solution in a single bin, SAIL finds solutions of similar quality in every bin.

Group Scissor: Scaling Neuromorphic Computing Design to Big Neural Networks

Yandan Wang , Wei Wen , Beiye Liu , Donald Chiarulli , Hai Li

Comments: Accepted in DAC 2017

Subjects

Neural and Evolutionary Computing (cs.NE)

; Artificial Intelligence (cs.AI)

Synapse crossbar is an elementary structure in Neuromorphic Computing Systems

(NCS). However, the limited size of crossbars and heavy routing congestion

impedes the NCS implementations of big neural networks. In this paper, we

propose a two-step framework (namely, extit{group scissor}) to scale NCS

designs to big neural networks. The first step is extit{rank clipping}, which

integrates low-rank approximation into the training to reduce total crossbar

area. The second step is extit{group connection deletion}, which structurally

prunes connections to reduce routing congestion between crossbars. Tested on

convolutional neural networks of extit{LeNet} on MNIST database and

extit{ConvNet} on CIFAR-10 database, our experiments show significant

reduction of crossbar area and routing area in NCS designs. Without accuracy

loss, rank clipping reduces total crossbar area to 13.62/% and 51.81/% in the

NCS designs of extit{LeNet} and extit{ConvNet}, respectively. Following

rank clipping, group connection deletion further reduces the routing area of

extit{LeNet} and extit{ConvNet} to 8.1/% and 52.06/%, respectively.

Whale swarm algorithm for function optimization

Bing Zeng , Liang Gao , Xinyu Li

Comments: 8 pages, 5 figures

Subjects

Neural and Evolutionary Computing (cs.NE)

Increasing nature-inspired metaheuristic algorithms are applied to solving

the real-world optimization problems, as they have some advantages over the

classical methods of numerical optimization. This paper has proposed a new

nature-inspired metaheuristic called Whale Swarm Algorithm for function

optimization, which is inspired by the whales behavior of communicating with

each other via ultrasound for hunting. The proposed Whale Swarm Algorithm has

been compared with several popular metaheuristic algorithms on comprehensive

performance metrics. According to the experimental results, Whale Swarm

Algorithm has a quite competitive performance when compared with other

algorithms.

Computer Vision and Pattern Recognition

Cognitive Mapping and Planning for Visual Navigation

Saurabh Gupta , James Davidson , Sergey Levine , Rahul Sukthankar , Jitendra Malik

Comments: Under review for CVPR 2017. Project webpage: this https URL

Subjects

Computer Vision and Pattern Recognition (cs.CV)

; Artificial Intelligence (cs.AI); Learning (cs.LG); Robotics (cs.RO)

We introduce a neural architecture for navigation in novel environments. Our

proposed architecture learns to map from first-person viewpoints and plans a

sequence of actions towards goals in the environment. The Cognitive Mapper and

Planner (CMP) is based on two key ideas: a) a unified joint architecture for

mapping and planning, such that the mapping is driven by the needs of the

planner, and b) a spatial memory with the ability to plan given an incomplete

set of observations about the world. CMP constructs a top-down belief map of

the world and applies a differentiable neural net planner to produce the next

action at each time step. The accumulated belief of the world enables the agent

to track visited regions of the environment. Our experiments demonstrate that

CMP outperforms both reactive strategies and standard memory-based

architectures and performs well in novel environments. Furthermore, we show

that CMP can also achieve semantically specified goals, such as ‘go to a

chair’.

Estimation of the volume of the left ventricle from MRI images using deep neural networks

Fangzhou Liao , Xi Chen , Xiaolin Hu , Sen Song

Comments: 10 pages, 10 figures

Subjects

Computer Vision and Pattern Recognition (cs.CV)

Segmenting human left ventricle (LV) in magnetic resonance imaging (MRI)

images and calculating its volume are important for diagnosing cardiac

diseases. In 2016, Kaggle organized a competition to estimate the volume of LV

from MRI images. The dataset consisted of a large number of cases, but only

provided systole and diastole volumes as labels. We designed a system based on

neural networks to solve this problem. It began with a detector combined with a

neural network classifier for detecting regions of interest (ROIs) containing

LV chambers. Then a deep neural network named hypercolumns fully convolutional

network was used to segment LV in ROIs. The 2D segmentation results were

integrated across different images to estimate the volume. With ground-truth

volume labels, this model was trained end-to-end. To improve the result, an

additional dataset with only segmentation label was used. The model was trained

alternately on these two datasets with different types of teaching signals. We

also proposed a variance estimation method for the final prediction. Our

algorithm ranked the 4th on the test set in this competition.

Online People Tracking and Identification with RFID and Kinect

Xinyu Li , Yanyi Zhang , Ivan Marsic , Randall S. Burd

Comments: 8 Pages, 8 Figures

Subjects

Computer Vision and Pattern Recognition (cs.CV)

We introduce a novel, accurate and practical system for real-time people

tracking and identification. We used a Kinect V2 sensor for tracking that

generates a body skeleton for up to six people in the view. We perform

identification using both Kinect and passive RFID, by first measuring the

velocity vector of person’s skeleton and of their RFID tag using the position

of the RFID reader antennas as reference points and then finding the best match

between skeletons and tags. We introduce a method for synchronizing Kinect

data, which is captured regularly, with irregular or missing RFID data

readouts. Our experiments show centimeter-level people tracking resolution with

80% average identification accuracy for up to six people in indoor

environments, which meets the needs of many applications. Our system can

preserve user privacy and work with different lighting.

An Efficient Decomposition Framework for Discriminative Segmentation with Supermodular Losses

Jiaqian Yu , Matthew B. Blaschko Subjects : Computer Vision and Pattern Recognition (cs.CV)

Several supermodular losses have been shown to improve the perceptual quality

of image segmentation in a discriminative framework such as a structured output

support vector machine (SVM). These loss functions do not necessarily have the

same structure as the one used by the segmentation inference algorithm, and in

general, we may have to resort to generic submodular minimization algorithms

for loss augmented inference. Although these come with polynomial time

guarantees, they are not practical to apply to image scale data. Many

supermodular losses come with strong optimization guarantees, but are not

readily incorporated in a loss augmented graph cuts procedure. This motivates

our strategy of employing the alternating direction method of multipliers

(ADMM) decomposition for loss augmented inference. In doing so, we create a new

API for the structured SVM that separates the maximum a posteriori (MAP)

inference of the model from the loss augmentation during training. In this way,

we gain computational efficiency, making new choices of loss functions

practical for the first time, while simultaneously making the inference

algorithm employed during training closer to the test time procedure. We show

improvement both in accuracy and computational performance on the Microsoft

Research Grabcut database and a brain structure segmentation task, empirically

validating the use of several supermodular loss functions during training, and

the improved computational properties of the proposed ADMM approach over the

Fujishige-Wolfe minimum norm point algorithm.

Unsupervised temporal context learning using convolutional neural networks for laparoscopic workflow analysis

Sebastian Bodenstedt (1), Martin Wagner (2), Darko Katić (1), Patrick Mietkowski (2), Benjamin Mayer (2), Hannes Kenngott (2), Beat Müller-Stich (2), Rüdiger Dillmann (1), Stefanie Speidel (1) ((1) Institute for Anthropomatics and Robotics, Karlsruhe Institute of Technology, Karlsruhe, (2) Department of General, Visceral and Transplant Surgery, University of Heidelberg, Heidelberg) Subjects : Computer Vision and Pattern Recognition (cs.CV)

Computer-assisted surgery (CAS) aims to provide the surgeon with the right

type of assistance at the right moment. Such assistance systems are especially

relevant in laparoscopic surgery, where CAS can alleviate some of the drawbacks

that surgeons incur. For many assistance functions, e.g. displaying the

location of a tumor at the appropriate time or suggesting what instruments to

prepare next, analyzing the surgical workflow is a prerequisite. Since

laparoscopic interventions are performed via endoscope, the video signal is an

obvious sensor modality to rely on for workflow analysis.

Image-based workflow analysis tasks in laparoscopy, such as phase

recognition, skill assessment, video indexing or automatic annotation, require

a temporal distinction between video frames. Generally computer vision based

methods that generalize from previously seen data are used. For training such

methods, large amounts of annotated data are necessary. Annotating surgical

data requires expert knowledge, therefore collecting a sufficient amount of

data is difficult, time-consuming and not always feasible.

In this paper, we address this problem by presenting an unsupervised method

for training a convolutional neural network (CNN) to differentiate between

laparoscopic video frames on a temporal basis. We extract video frames at

regular intervals from 324 unlabeled laparoscopic interventions, resulting in a

dataset of approximately 2.2 million images. From this dataset, we extract

image pairs from the same video and train a CNN to determine their temporal

order. To solve this problem, the CNN has to extract features that are relevant

for comprehending laparoscopic workflow.

Furthermore, we demonstrate that such a CNN can be adapted for surgical

workflow segmentation. We performed image-based workflow segmentation on a

publicly available dataset of 7 cholecystectomies and 9 colorectal

interventions.

Underwater Optical Image Processing: A Comprehensive Review

Huimin Lu , Yujie Li , Yudong Zhang , Min Chen , Seiichi Serikawa , Hyoungseop Kim

Comments: 14 pages

Subjects

Computer Vision and Pattern Recognition (cs.CV)

Underwater cameras are widely used to observe the sea floor. They are usually

included in autonomous underwater vehicles, unmanned underwater vehicles, and

in situ ocean sensor networks. Despite being an important sensor for monitoring

underwater scenes, there exist many issues with recent underwater camera

sensors. Because of lights transportation characteristics in water and the

biological activity at the sea floor, the acquired underwater images often

suffer from scatters and large amounts of noise. Over the last five years, many

methods have been proposed to overcome traditional underwater imaging problems.

This paper aims to review the state-of-the-art techniques in underwater image

processing by highlighting the contributions and challenges presented in over

40 papers. We present an overview of various underwater image processing

approaches, such as underwater image descattering, underwater image color

restoration, and underwater image quality assessments. Finally, we summarize

the future trends and challenges in designing and processing underwater imaging

sensors.

Sparse Representation based Multi-sensor Image Fusion: A Review

Qiang Zhang , Yi Liu , Rick S. Blum , Jungong Han , Dacheng Tao

Comments: 19 pages

Subjects

Computer Vision and Pattern Recognition (cs.CV)

As a result of several successful applications in computer vision and image

processing, sparse representation (SR) has attracted significant attention in

multi-sensor image fusion. Unlike the traditional multiscale transforms (MSTs)

that presume the basis functions, SR learns an over-complete dictionary from a

set of training images for image fusion, and it achieves more stable and

meaningful representations of the source images. By doing so, the SR-based

fusion methods generally outperform the traditional MST-based image fusion

methods in both subjective and objective tests. In addition, they are less

susceptible to mis-registration among the source images, thus facilitating the

practical applications. This survey paper proposes a systematic review of the

SR-based multi-sensor image fusion literature, highlighting the pros and cons

of each category of approaches. Specifically, we start by performing a

theoretical investigation of the entire system from three key algorithmic

aspects, (1) sparse representation models; (2) dictionary learning methods; and

(3) activity levels and fusion rules. Subsequently, we show how the existing

works address these scientific problems and design the appropriate fusion rules

for each application, such as multi-focus image fusion and multi-modality

(e.g., infrared and visible) image fusion. At last, we carry out some

experiments to evaluate the impact of these three algorithmic components on the

fusion performance when dealing with different applications. This article is

expected to serve as a tutorial and source of reference for researchers

preparing to enter the field or who desire to employ the sparse representation

theory in other fields.

A Novel Weight-Shared Multi-Stage Network Architecture of CNNs for Scale Invariance

Ryo Takahashi , Takashi Matsubara , Kuniaki Uehara Subjects : Computer Vision and Pattern Recognition (cs.CV)

Convolutional neural networks (CNNs) have demonstrated remarkable results in

image classification tasks for benchmark and practical uses. The CNNs with

deeper architectures have achieved higher performances thanks to their numerous

parameters and resulting high expression ability recently. However, the CNNs

have a problem of limited robustness to geometric transformation of objects in

images such as scaling and rotation. This problem is considered to limit

performance improvement of the deep CNNs but there is no established solution.

This study focuses on scale transformation and proposes a novel network

architecture called weight-shared multi-stage network (WSMS-Net), which enables

the existing deep CNNs, such as ResNet and DenseNet, to acquire robustness to

scaling of objects. The WSMS-Net architecture consists of multiple stages of

CNNs and is easily combined with existing deep CNNs. This study demonstrates

that existing deep CNNs combined the proposed WSMS-Net archive higher accuracy

for image classification tasks only with little increase in the number of

parameters.

Crossing Nets: Dual Generative Models with a Shared Latent Space for Hand Pose Estimation

Chengde Wan , Thomas Probst , Luc Van Gool , Angela Yao

Comments: 10 pages, 5 figures

Subjects

Computer Vision and Pattern Recognition (cs.CV)

State-of-the-art methods for 3D hand pose estimation from depth images

require large amounts of annotated training data. We propose to model the

statistical relationships of 3D hand poses and corresponding depth images using

two deep generative models with a shared latent space. By design, our

architecture allows for learning from unlabeled image data in a semi-supervised

manner. Assuming a one-to-one mapping between a pose and a depth map, any given

point in the shared latent space can be projected into both a hand pose and a

corresponding depth map. Regressing the hand pose can then be done by learning

a discriminator to estimate the posterior of the latent pose given some depth

map. To improve generalization and to better exploit unlabeled depth maps, we

jointly train a generator and a discriminator. At each iteration, the generator

is updated with the back-propagated gradient from the discriminator to

synthesize realistic depth maps of the articulated hand, while the

discriminator benefits from an augmented training set of synthesized and

unlabeled samples. The proposed discriminator network architecture is highly

efficient and runs at 90 FPS on the CPU with accuracies comparable or better

than state-of-art on 3 publicly available benchmarks.

ArtGAN: Artwork Synthesis with Conditional Categorial GANs

Wei Ren Tan , Chee Seng Chan , Hernan Aguirre , Kiyoshi Tanaka

Comments: 10 pages, 10 figures, submitted to ICIP2017 (extension version)

Subjects

Computer Vision and Pattern Recognition (cs.CV)

This paper proposes an extension to the Generative Adversarial Networks

(GANs), namely as ARTGAN to synthetically generate more challenging and complex

images such as artwork that have abstract characteristics. This is in contrast

to most of the current solutions that focused on generating natural images such

as room interiors, birds, flowers and faces. The key innovation of our work is

to allow back-propagation of the loss function w.r.t. the labels (randomly

assigned to each generated images) to the generator from the discriminator.

With the feedback from the label information, the generator is able to learn

faster and achieve better generated image quality. Empirically, we show that

the proposed ARTGAN is capable to create realistic artwork, as well as generate

compelling real world images that globally look natural with clear shape on

CIFAR-10.

Reverse Classification Accuracy: Predicting Segmentation Performance in the Absence of Ground Truth

Vanya V. Valindria , Ioannis Lavdas , Wenjia Bai , Konstantinos Kamnitsas , Eric O. Aboagye , Andrea G. Rockall , Daniel Rueckert , Ben Glocker

Comments: Accepted article to appear in IEEE Transactions on Medical Imaging 2017

Subjects

Computer Vision and Pattern Recognition (cs.CV)

When integrating computational tools such as automatic segmentation into

clinical practice, it is of utmost importance to be able to assess the level of

accuracy on new data, and in particular, to detect when an automatic method

fails. However, this is difficult to achieve due to absence of ground truth.

Segmentation accuracy on clinical data might be different from what is found

through cross-validation because validation data is often used during

incremental method development, which can lead to overfitting and unrealistic

performance expectations. Before deployment, performance is quantified using

different metrics, for which the predicted segmentation is compared to a

reference segmentation, often obtained manually by an expert. But little is

known about the real performance after deployment when a reference is

unavailable. In this paper, we introduce the concept of reverse classification

accuracy (RCA) as a framework for predicting the performance of a segmentation

method on new data. In RCA we take the predicted segmentation from a new image

to train a reverse classifier which is evaluated on a set of reference images

with available ground truth. The hypothesis is that if the predicted

segmentation is of good quality, then the reverse classifier will perform well

on at least some of the reference images. We validate our approach on

multi-organ segmentation with different classifiers and segmentation methods.

Our results indicate that it is indeed possible to predict the quality of

individual segmentations, in the absence of ground truth. Thus, RCA is ideal

for integration into automatic processing pipelines in clinical routine and as

part of large-scale image analysis studies.

Enhanced Local Binary Patterns for Automatic Face Recognition

Pavel Král , Antonín Vrba

Comments: Submitted for ICIP 2017

Subjects

Computer Vision and Pattern Recognition (cs.CV)

This paper presents a novel automatic face recognition approach based on

local binary patterns (LBP). LBP descriptor considers a local neighbourhood of

a pixel to compute the features. This method is not very robust to handle image

noise, variances and different illumination conditions. In this paper, we

address these issues and extend the original LBP operator by considering more

pixels and different neighbourhoods to compute the feature vector. The proposed

method is evaluated on two benchmark corpora, namely UFI and FERET face

datasets. We experimentally show that our approach is very efficient because it

significantly outperforms several other state-of-the-art methods and is

efficient particularly in the real conditions where the above mentioned issues

are obvious.

Multi-Resolution Dual-Tree Wavelet Scattering Network for Signal Classification

Amarjot Singh , Nick Kingsbury Subjects : Computer Vision and Pattern Recognition (cs.CV)

This paper introduces a Deep Scattering network that utilizes Dual-Tree

complex wavelets to extract translation invariant representations from an input

signal. The computationally efficient Dual-Tree wavelets decompose the input

signal into densely spaced representations over scales. Translation invariance

is introduced in the representations by applying a non-linearity over a region

followed by averaging. The discriminatory information in the densely spaced,

locally smooth, signal representations aids the learning of the classifier. The

proposed network is shown to outperform Mallat’s ScatterNet on four datasets

with different modalities on classification accuracy.

Distributed Mapping with Privacy and Communication Constraints: Lightweight Algorithms and Object-based Models

Siddharth Choudhary , Luca Carlone , Carlos Nieto , John Rogers , Henrik I. Christensen , Frank Dellaert

Comments: preprint for IJRR submission

Subjects

Robotics (cs.RO)

; Computer Vision and Pattern Recognition (cs.CV)

We consider the following problem: a team of robots is deployed in an unknown

environment and it has to collaboratively build a map of the area without a

reliable infrastructure for communication. The backbone for modern mapping

techniques is pose graph optimization, which estimates the trajectory of the

robots, from which the map can be easily built. The first contribution of this

paper is a set of distributed algorithms for pose graph optimization: rather

than sending all sensor data to a remote sensor fusion server, the robots

exchange very partial and noisy information to reach an agreement on the pose

graph configuration. Our approach can be considered as a distributed

implementation of the two-stage approach of Carlone et al., where we use the

Successive Over-Relaxation (SOR) and the Jacobi Over-Relaxation (JOR) as

workhorses to split the computation among the robots. As a second contribution,

we extend %and demonstrate the applicability of the proposed distributed

algorithms to work with object-based map models. The use of object-based models

avoids the exchange of raw sensor measurements (e.g., point clouds) further

reducing the communication burden. Our third contribution is an extensive

experimental evaluation of the proposed techniques, including tests in

realistic Gazebo simulations and field experiments in a military test facility.

Abundant experimental evidence suggests that one of the proposed algorithms

(the Distributed Gauss-Seidel method or DGS) has excellent performance. The DGS

requires minimal information exchange, has an anytime flavor, scales well to

large teams, is robust to noise, and is easy to implement. Our field tests show

that the combined use of our distributed algorithms and object-based models

reduces the communication requirements by several orders of magnitude and

enables distributed mapping with large teams of robots in real-world problems.

Artificial Intelligence

Bilateral Multi-Perspective Matching for Natural Language Sentences

Zhiguo Wang , Wael Hamza , Radu Florian

Comments: 7

Subjects

Artificial Intelligence (cs.AI)

; Computation and Language (cs.CL)

Natural language sentence matching is a fundamental technology for a variety

of tasks. Previous approaches either match sentences from a single direction or

only apply single granular (word-by-word or sentence-by-sentence) matching. In

this work, we propose a bilateral multi-perspective matching (BiMPM) model

under the “matching-aggregation” framework. Given two sentences (P) and (Q),

our model first encodes them with a BiLSTM encoder. Next, we match the two

encoded sentences in two directions (P

ightarrow Q) and (P leftarrow Q). In

each matching direction, each time step of one sentence is matched against all

time-steps of the other sentence from multiple perspectives. Then, another

BiLSTM layer is utilized to aggregate the matching results into a fix-length

matching vector. Finally, based on the matching vector, the decision is made

through a fully connected layer. We evaluate our model on three tasks:

paraphrase identification, natural language inference and answer sentence

selection. Experimental results on standard benchmark datasets show that our

model achieves the state-of-the-art performance on all tasks.

Traditional PageRank versus Network Capacity Bound

Mieczysław A.Kłopotek , Sławomir T.Wierzchom , Robert A. Kłopotek , Elżbieta A. Kłopotek Subjects : Artificial Intelligence (cs.AI)

In a former paper we simplified the proof of a theorem on personalized random

walk that is fundamental to graph nodes clustering and generalized it to

bipartite graphs for a specific case where the proobability of random jump was

proprtional to the number of links of “personally prefereed” nodes. In this

paper we turn to the more complex issue of graphs in which the random jump

follows uniform distribution.

On Seeking Consensus Between Document Similarity Measures

Mieczysław Kłopotek Subjects : Artificial Intelligence (cs.AI)

This paper investigates the application of consensus clustering and

meta-clustering to the set of all possible partitions of a data set. We show

that when using a “complement” of Rand Index as a measure of cluster

similarity, the total-separation partition, putting each element in a separate

set, is chosen.

Genetic and Memetic Algorithm with Diversity Equilibrium based on Greedy Diversification

Andrés Herrera-Poyatos , Francisco Herrera

Comments: 27 pages, 5 figures, 11 tables

Subjects

Artificial Intelligence (cs.AI)

The lack of diversity in a genetic algorithm’s population may lead to a bad

performance of the genetic operators since there is not an equilibrium between

exploration and exploitation. In those cases, genetic algorithms present a fast

and unsuitable convergence.

In this paper we develop a novel hybrid genetic algorithm which attempts to

obtain a balance between exploration and exploitation. It confronts the

diversity problem using the named greedy diversification operator. Furthermore,

the proposed algorithm applies a competition between parent and children so as

to exploit the high quality visited solutions. These operators are complemented

by a simple selection mechanism designed to preserve and take advantage of the

population diversity.

Additionally, we extend our proposal to the field of memetic algorithms,

obtaining an improved model with outstanding results in practice.

The experimental study shows the validity of the approach as well as how

important is taking into account the exploration and exploitation concepts when

designing an evolutionary algorithm.

Graph Neural Networks and Boolean Satisfiability

Benedikt Bünz , Matthew Lamm Subjects : Artificial Intelligence (cs.AI)

In this paper we explore whether or not deep neural architectures can learn

to classify Boolean satisfiability (SAT). We devote considerable time to

discussing the theoretical properties of SAT. Then, we define a graph

representation for Boolean formulas in conjunctive normal form, and train

neural classifiers over general graph structures called Graph Neural Networks,

or GNNs, to recognize features of satisfiability. To the best of our knowledge

this has never been tried before. Our preliminary findings are potentially

profound. In a weakly-supervised setting, that is, without problem specific

feature engineering, Graph Neural Networks can learn features of

satisfiability.

Similarity Preserving Representation Learning for Time Series Analysis

Qi Lei , Jinfeng Yi , Roman Vaculin , Lingfei Wu , Inderjit S. Dhillon Subjects : Artificial Intelligence (cs.AI) ; Learning (cs.LG)

A considerable amount of machine learning algorithms take matrices as their

inputs. As such, they cannot directly analyze time series data due to its

temporal nature, usually unequal lengths, and complex properties. This is a

great pity since many of these algorithms are effective, robust, efficient, and

easy to use. In this paper, we bridge this gap by proposing an efficient

representation learning framework that is able to convert a set of time series

with equal or unequal lengths to a matrix format. In particular, we guarantee

that the pairwise similarities between time series are well preserved after the

transformation. Therefore, the learned feature representation is particularly

suitable to the class of learning problems that are sensitive to data

similarities. Given a set of (n) time series, we first construct an (n imes n)

partially observed similarity matrix by randomly sampling (O(n log n)) pairs

of time series and computing their pairwise similarities. We then propose an

extremely efficient algorithm that solves a highly non-convex and NP-hard

problem to learn new features based on the partially observed similarity

matrix. We use the learned features to conduct experiments on both data

classification and clustering tasks. Our extensive experimental results

demonstrate that the proposed framework is both effective and efficient.

Octopus: A Framework for Cost-Quality-Time Optimization in Crowdsourcing

Karan Goel , Shreya Rajpal , Mausam Subjects : Artificial Intelligence (cs.AI) ; Human-Computer Interaction (cs.HC); Multiagent Systems (cs.MA)

Managing micro-tasks on crowdsourcing marketplaces involves balancing

conflicting objectives — the quality of work, total cost incurred and time to

completion. Previous agents have focused on cost-quality, or cost-time

tradeoffs, limiting their real-world applicability. As a step towards this goal

we present Octopus, the first AI agent that jointly manages all three

objectives in tandem. Octopus is based on a computationally tractable,

multi-agent formulation consisting of three components; one that sets the price

per ballot to adjust the rate of completion of tasks, another that optimizes

each task for quality and a third that performs task selection. We demonstrate

that Octopus outperforms existing state-of-the-art approaches in simulation and

experiments with real data, demonstrating its superior performance. We also

deploy Octopus on Amazon Mechanical Turk to establish its ability to manage

tasks in a real-world, dynamic setting.

A Minimax Algorithm Better Than Alpha-beta?: No and Yes

Aske Plaat , Jonathan Schaeffer , Wim Pijls , Arie de Bruin

Comments: Report version of AI Journal article Best-first fixed-depth minimax algorithms 1996

Subjects

Artificial Intelligence (cs.AI)

This paper has three main contributions to our understanding of fixed-depth

minimax search: (A) A new formulation for Stockman’s SSS* algorithm, based on

Alpha-Beta, is presented. It solves all the perceived drawbacks of SSS*,

finally transforming it into a practical algorithm. In effect, we show that

SSS* = alpha-beta + ransposition tables. The crucial step is the realization

that transposition tables contain so-called solution trees, structures that are

used in best-first search algorithms like SSS*. Having created a practical

version, we present performance measurements with tournament game-playing

programs for three different minimax games, yielding results that contradict a

number of publications. (B) Based on the insights gained in our attempts at

understanding SSS*, we present a framework that facilitates the construction of

several best-first fixed- depth game-tree search algorithms, known and new. The

framework is based on depth-first null-window Alpha-Beta search, enhanced with

storage to allow for the refining of previous search results. It focuses

attention on the essential differences between algorithms. (C) We present a new

instance of the framework, MTD(f). It is well-suited for use with iterative

deepening, and performs better than algorithms that are currently used in most

state-of-the-art game-playing programs. We provide experimental evidence to

explain why MTD(f) performs better than the other fixed-depth minimax