转载

arXiv Paper Daily: Fri, 3 Feb 2017

Neural and Evolutionary Computing

Dominance Move: A Measure of Comparing Solution Sets in Multiobjective Optimization

Miqing Li , Xin Yao

Comments: 23 pages, 10 figures

Subjects

Neural and Evolutionary Computing (cs.NE)

; Optimization and Control (math.OC)

One of the most common approaches for multiobjective optimization is to

generate a solution set that well approximates the whole Pareto-optimal

frontier to facilitate the later decision-making process. However, how to

evaluate and compare the quality of different solution sets remains

challenging. Existing measures typically require additional problem knowledge

and information, such as a reference point or a substituted set of the

Pareto-optimal frontier. In this paper, we propose a quality measure, called

dominance move (DoM), to compare solution sets generated by multiobjective

optimizers. Given two solution sets, DoM measures the minimum sum of move

distances for one set to weakly Pareto dominate the other set. DoM can be seen

as a natural reflection of the difference between two solutions, capturing all

aspects of solution sets’ quality, being compliant with Pareto dominance, and

does not need any additional problem knowledge and parameters. We present an

exact method to calculate the DoM in the biobjective case. We show the

necessary condition of constructing the optimal partition for a solution set’s

minimum move, and accordingly propose an efficient algorithm to recursively

calculate the DoM. Finally, DoM is evaluated on several groups of artificial

and real test cases as well as by a comparison with two well-established

quality measures.

Scaling Properties of Human Brain Functional Networks

Riccardo Zucca , Xerxes D. Arsiwalla , Hoang Le , Mikail Rubinov , Paul Verschure

Comments: International Conference on Artificial Neural Networks – ICANN 2016

Journal-ref: Artificial Neural Networks and Machine Learning, Lecture Notes in

Computer Science, vol 9886, 2016

Subjects

Neurons and Cognition (q-bio.NC)

; Disordered Systems and Neural Networks (cond-mat.dis-nn); Neural and Evolutionary Computing (cs.NE); Data Analysis, Statistics and Probability (physics.data-an)

We investigate scaling properties of human brain functional networks in the

resting-state. Analyzing network degree distributions, we statistically test

whether their tails scale as power-law or not. Initial studies, based on

least-squares fitting, were shown to be inadequate for precise estimation of

power-law distributions. Subsequently, methods based on maximum-likelihood

estimators have been proposed and applied to address this question.

Nevertheless, no clear consensus has emerged, mainly because results have shown

substantial variability depending on the data-set used or its resolution. In

this study, we work with high-resolution data (10K nodes) from the Human

Connectome Project and take into account network weights. We test for the

power-law, exponential, log-normal and generalized Pareto distributions. Our

results show that the statistics generally do not support a power-law, but

instead these degree distributions tend towards the thin-tail limit of the

generalized Pareto model. This may have implications for the number of hubs in

human brain functional networks.

Learning Criticality in an Embodied Boltzmann Machine

Miguel Aguilera , Manuel G. Bedia Subjects : Adaptation and Self-Organizing Systems (nlin.AO) ; Disordered Systems and Neural Networks (cond-mat.dis-nn); Statistical Mechanics (cond-mat.stat-mech); Neural and Evolutionary Computing (cs.NE); Neurons and Cognition (q-bio.NC)

Many biological and cognitive systems do not operate deep into one or other

regime of activity. Instead, they exploit critical surfaces poised at

transitions in their parameter space. The pervasiveness of criticality in

natural systems suggests that there may be general principles inducing this

behaviour. However, there is a lack of conceptual models explaining how

embodied agents propel themselves towards these critical points. In this paper,

we present a learning model driving an embodied Boltzmann Machine towards

critical behaviour by maximizing the heat capacity of the network. We test and

corroborate the model implementing an embodied agent in the mountain car

benchmark, controlled by a Boltzmann Machine that adjust its weights according

to the model. We find that the neural controller reaches a point of

criticality, which coincides with a transition point of the behaviour of the

agent between two regimes of behaviour, maximizing the synergistic information

between its sensors and the hidden and motor neurons. Finally, we discuss the

potential of our learning model to study the contribution of criticality to the

behaviour of embodied living systems in scenarios not necessarily constrained

by biological restrictions of the examples of criticality we find in nature.

Information-theoretic interpretation of tuning curves for multiple motion directions

Wentao Huang , Xin Huang , Kechen Zhang

Comments: The 51st Annual Conference on Information Sciences and Systems (CISS), 2017

Subjects

Information Theory (cs.IT)

; Neural and Evolutionary Computing (cs.NE); Neurons and Cognition (q-bio.NC); Quantitative Methods (q-bio.QM)

We have developed an efficient information-maximization method for computing

the optimal shapes of tuning curves of sensory neurons by optimizing the

parameters of the underlying feedforward network model. When applied to the

problem of population coding of visual motion with multiple directions, our

method yields several types of tuning curves with both symmetric and asymmetric

shapes that resemble what have been found in the visual cortex. Our result

suggests that the diversity or heterogeneity of tuning curve shapes as observed

in neurophysiological experiment might actually constitute an optimal

population representation of visual motions with multiple components.

On orthogonality and learning recurrent networks with long term dependencies

Eugene Vorontsov , Chiheb Trabelsi , Samuel Kadoury , Chris Pal Subjects : Learning (cs.LG) ; Neural and Evolutionary Computing (cs.NE)

It is well known that it is challenging to train deep neural networks and

recurrent neural networks for tasks that exhibit long term dependencies. The

vanishing or exploding gradient problem is a well known issue associated with

these challenges. One approach to addressing vanishing and exploding gradients

is to use either soft or hard constraints on weight matrices so as to encourage

or enforce orthogonality. Orthogonal matrices preserve gradient norm during

backpropagation and can therefore be a desirable property; however, we find

that hard constraints on orthogonality can negatively affect the speed of

convergence and model performance. This paper explores the issues of

optimization convergence, speed and gradient stability using a variety of

different methods for encouraging or enforcing orthogonality. In particular we

propose a weight matrix factorization and parameterization strategy through

which we can bound matrix norms and therein control the degree of expansivity

induced during backpropagation.

Computer Vision and Pattern Recognition

Pixel Recursive Super Resolution

Ryan Dahl , Mohammad Norouzi , Jonathon Shlens Subjects : Computer Vision and Pattern Recognition (cs.CV) ; Learning (cs.LG)

We present a pixel recursive super resolution model that synthesizes

realistic details into images while enhancing their resolution. A low

resolution image may correspond to multiple plausible high resolution images,

thus modeling the super resolution process with a pixel independent conditional

model often results in averaging different details–hence blurry edges. By

contrast, our model is able to represent a multimodal conditional distribution

by properly modeling the statistical dependencies among the high resolution

image pixels, conditioned on a low resolution input. We employ a PixelCNN

architecture to define a strong prior over natural images and jointly optimize

this prior with a deep conditioning convolutional network. Human evaluations

indicate that samples from our proposed model look more photo realistic than a

strong L2 regression baseline.

Maritime situational awareness using adaptive multi-sensor management under hazy conditions

D. K. Prasad , C. K. Prasath , D. Rajan , L. Rachmawati , E. Rajabally , C. Quek

Comments: 11 pages, 2 figures, MTEC 2017

Subjects

Computer Vision and Pattern Recognition (cs.CV)

This paper presents a multi-sensor architecture with an adaptive multi-sensor

management system suitable for control and navigation of autonomous maritime

vessels in hazy and poor-visibility conditions. This architecture resides in

the autonomous maritime vessels. It augments the data from on-board imaging

sensors and weather sensors with the AIS data and weather data from sensors on

other vessels and the on-shore vessel traffic surveillance system. The combined

data is analyzed using computational intelligence and data analytics to

determine suitable course of action while utilizing historically learnt

knowledge and performing live learning from the current situation. Such

framework is expected to be useful in diverse weather conditions and shall be a

useful architecture to provide autonomy to maritime vessels.

Handwritten Recognition Using SVM, KNN and Neural Network

Norhidayu Abdul Hamid , Nilam Nur Amir Sjarif

Comments: 11 pages ; 22 Figures

Subjects

Computer Vision and Pattern Recognition (cs.CV)

Handwritten recognition (HWR) is the ability of a computer to receive and

interpret intelligible handwritten input from source such as paper documents,

photographs, touch-screens and other devices. In this paper we will using three

(3) classification t o re cognize the handwritten which is SVM, KNN and Neural

Network.

Learning a time-dependent master saliency map from eye-tracking data in videos

Antoine Coutrot , Nathalie Guyader Subjects : Computer Vision and Pattern Recognition (cs.CV)

To predict the most salient regions of complex natural scenes, saliency

models commonly compute several feature maps (contrast, orientation, motion…)

and linearly combine them into a master saliency map. Since feature maps have

different spatial distribution and amplitude dynamic ranges, determining their

contributions to overall saliency remains an open problem. Most

state-of-the-art models do not take time into account and give feature maps

constant weights across the stimulus duration. However, visual exploration is a

highly dynamic process shaped by many time-dependent factors. For instance,

some systematic viewing patterns such as the center bias are known to

dramatically vary across the time course of the exploration. In this paper, we

use maximum likelihood and shrinkage methods to dynamically and jointly learn

feature map and systematic viewing pattern weights directly from eye-tracking

data recorded on videos. We show that these weights systematically vary as a

function of time, and heavily depend upon the semantic visual category of the

videos being processed. Our fusion method allows taking these variations into

account, and outperforms other state-of-the-art fusion schemes using constant

weights over time. The code, videos and eye-tracking data we used for this

study are available online:

this http URL

Side Information in Robust Principle Component Analysis: Algorithms and Applications

Niannan Xue , Yannis Panagakis , Stefanos Zafeiriou

Comments: Submitted to CVPR

Subjects

Computer Vision and Pattern Recognition (cs.CV)

Robust rank minimisation aims at recovering a low-rank subspace from grossly

corrupted high-dimensional (often visual) data and is a cornerstone in many

machine learning and computer vision applications. The most prominent method

for this task is the Robust Principal Component Analysis (PCA). It recovers a

low-rank matrix from sparse corruptions of unknown magnitude and support by

Principal Component Pursuit (PCP), which is a convex approximation to the

otherwise NP-hard rank minimisation problem. Even though PCP has been shown to

be very successful in solving many rank minimisation problems, there are cases

where degenerate or suboptimal solutions are obtained. This can be attributed

to the fact that domain-dependent prior knowledge is not taken into account by

PCP. In this paper, we address the problem of PCP when prior information is

available. To this end, we propose algorithms for solving the PCP problem with

the aid of prior information on the low-rank structure of the data. The

versatility of the proposed methods is demonstrated by applying them to four

applications, namely background substraction, facial image denoising, face and

facial expression recognition. Experimental results on synthetic and five real

world datasets indicate the robustness and effectiveness of the proposed

methods on these application domains, largely outperforming previous approaches

that incorporate side information within Robust PCA.

A Fast and Compact Salient Score Regression Network Based on Fully Convolutional Network

Xuanyang Xi , Yongkang Luo , Fengfu Li , Peng Wang , Hong Qiao Subjects : Computer Vision and Pattern Recognition (cs.CV)

Visual saliency detection aims at identifying the most visually distinctive

parts in an image, and serves as a pre-processing step for a variety of

computer vision and image processing tasks. To this end, the saliency detection

procedure must be as fast and compact as possible and optimally processes input

images in a real time manner. However, contemporary detection methods always

take hundreds of milliseconds to pursue feeble improvements on the detection

precession. In this paper, we tackle this problem by proposing a fast and

compact salient score regression network which employs deep convolutional

neural networks (CNN) to estimate the saliency of objects in images. It

operates (including training and testing) in an end-to-end manner

(image-to-image prediction) and also directly produces whole saliency maps from

original images without any pre-processings and post-processings. Comparing

with contemporary CNN-based saliency detection methods, the proposed method

extremely simplifies the detection procedure and further promotes the

representation ability of CNN for the saliency detection. Our method is

evaluated on six public datasets, and experimental results show that the

precision can be comparable to the published state-of-the-art methods while the

speed gets a significant improvement (35 FPS, processing in real time).

Automating Image Analysis by Annotating Landmarks with Deep Neural Networks

Mikhail Breslav , Tyson L. Hedrick , Stan Sclaroff , Margrit Betke

Comments: 30 pages

Subjects

Computer Vision and Pattern Recognition (cs.CV)

Image and video analysis is often a crucial step in the study of animal

behavior and kinematics. Often these analyses require that the position of one

or more animal landmarks are annotated (marked) in numerous images. The process

of annotating landmarks can require a significant amount of time and tedious

labor, which motivates the need for algorithms that can automatically annotate

landmarks. In the community of scientists that use image and video analysis to

study the 3D flight of animals, there has been a trend of developing more

automated approaches for annotating landmarks, yet they fall short of being

generally applicable. Inspired by the success of Deep Neural Networks (DNNs) on

many problems in the field of computer vision, we investigate how suitable DNNs

are for accurate and automatic annotation of landmarks in video datasets

representative of those collected by scientists studying animals.

Our work shows, through extensive experimentation on videos of hawkmoths,

that DNNs are suitable for automatic and accurate landmark localization. In

particular, we show that one of our proposed DNNs is more accurate than the

current best algorithm for automatic localization of landmarks on hawkmoth

videos. Moreover, we demonstrate how these annotations can be used to

quantitatively analyze the 3D flight of a hawkmoth. To facilitate the use of

DNNs by scientists from many different fields, we provide a self contained

explanation of what DNNs are, how they work, and how to apply them to other

datasets using the freely available library Caffe and supplemental code that we

provide.

Deep Learning the Indus Script

Satish Palaniappan , Ronojoy Adhikari

Comments: 17 pages, 10 figures, 7 supporting figures (2 pages)

Subjects

Computer Vision and Pattern Recognition (cs.CV)

; Computation and Language (cs.CL); Learning (cs.LG)

Standardized corpora of undeciphered scripts, a necessary starting point for

computational epigraphy, requires laborious human effort for their preparation

from raw archaeological records. Automating this process through machine

learning algorithms can be of significant aid to epigraphical research. Here,

we take the first steps in this direction and present a deep learning pipeline

that takes as input images of the undeciphered Indus script, as found in

archaeological artifacts, and returns as output a string of graphemes, suitable

for inclusion in a standard corpus. The image is first decomposed into regions

using Selective Search and these regions are classified as containing textual

and/or graphical information using a convolutional neural network. Regions

classified as potentially containing text are hierarchically merged and trimmed

to remove non-textual information. The remaining textual part of the image is

segmented using standard image processing techniques to isolate individual

graphemes. This set is finally passed to a second convolutional neural network

to classify the graphemes, based on a standard corpus. The classifier can

identify the presence or absence of the most frequent Indus grapheme, the “jar”

sign, with an accuracy of 92%. Our results demonstrate the great potential of

deep learning approaches in computational epigraphy and, more generally, in the

digital humanities.

Segmentation of optic disc, fovea and retinal vasculature using a single convolutional neural network

Jen Hong Tan , U. Rajendra Acharya , Sulatha V. Bhandary , Kuang Chua Chua , Sobha Sivaprasad Subjects : Computer Vision and Pattern Recognition (cs.CV) ; Learning (cs.LG)

We have developed and trained a convolutional neural network to automatically

and simultaneously segment optic disc, fovea and blood vessels. Fundus images

were normalised before segmentation was performed to enforce consistency in

background lighting and contrast. For every effective point in the fundus

image, our algorithm extracted three channels of input from the neighbourhood

of the point and forward the response across the 7 layer network. In average,

our segmentation achieved an accuracy of 92.68 percent on the testing set from

Drive database.

Solving Uncalibrated Photometric Stereo Using Fewer Images by Jointly Optimizing Low-rank Matrix Completion and Integrability

Soumyadip Sengupta , Hao Zhou , Walter Forkel , Ronen Basri , Tom Goldstein , David W. Jacobs Subjects : Computer Vision and Pattern Recognition (cs.CV)

We introduce a new, integrated approach to uncalibrated photometric stereo.

We perform 3D reconstruction of Lambertian objects using multiple images

produced by unknown, directional light sources. We show how to formulate a

single optimization that includes rank and integrability constraints, allowing

also for missing data. We then solve this optimization using the Alternate

Direction Method of Multipliers (ADMM). We conduct extensive experimental

evaluation on real and synthetic data sets. Our integrated approach is

particularly valuable when performing photometric stereo using as few as 4-6

images, since the integrability constraint is capable of improving estimation

of the linear subspace of possible solutions. We show good improvements over

prior work in these cases.

Algorithmic Performance-Accuracy Trade-off in 3D Vision Applications Using HyperMapper

Luigi Nardi , Bruno Bodin , Sajad Saeedi , Emanuele Vespa , Andrew J. Davison , Paul H. J. Kelly

Comments: 10 pages, Keywords: design space exploration, machine learning, computer vision, SLAM, embedded systems, GPU, crowd-sourcing

Subjects

Computer Vision and Pattern Recognition (cs.CV)

; Distributed, Parallel, and Cluster Computing (cs.DC); Learning (cs.LG); Performance (cs.PF)

In this paper we investigate an emerging application, 3D scene understanding,

likely to be significant in the mobile space in the near future. The goal of

this exploration is to reduce execution time while meeting our quality of

result objectives. In previous work we showed for the first time that it is

possible to map this application to power constrained embedded systems,

highlighting that decision choices made at the algorithmic design-level have

the most impact.

As the algorithmic design space is too large to be exhaustively evaluated, we

use a previously introduced multi-objective Random Forest Active Learning

prediction framework dubbed HyperMapper, to find good algorithmic designs. We

show that HyperMapper generalizes on a recent cutting edge 3D scene

understanding algorithm and on a modern GPU-based computer architecture.

HyperMapper is able to beat an expert human hand-tuning the algorithmic

parameters of the class of Computer Vision applications taken under

consideration in this paper automatically. In addition, we use crowd-sourcing

using a 3D scene understanding Android app to show that the Pareto front

obtained on an embedded system can be used to accelerate the same application

on all the 83 smart-phones and tablets crowd-sourced with speedups ranging from

2 to over 12.

Learning to Compose with Professional Photographs on the Web

Yi-Ling Chen , Jan Klopp , Min Sun , Shao-Yi Chien , Kwan-Liu Ma Subjects : Computer Vision and Pattern Recognition (cs.CV)

Photo composition is an important factor affecting the aesthetics in

photography. However, it is a highly challenging task to model the aesthetic

properties of good compositions due to the lack of globally applicable rules to

the wide variety of photographic styles. Inspired by the thinking process of

photo taking, we treat the photo composition problem as a view finding process

which successively examines pairs of views and determines the aesthetic

preference. Without devising complex hand-crafted features, the ranking model

is built upon a deep convolutional neural network through joint representation

learning from raw pixels. Exploiting rich professional photographs on the web

as data source, we devise a nearly unsupervised approach to generate unlimited

high quality image pairs for training the network. The resulting ranking model

is generic and without any heuristics. The experimental results show that the

proposed view finding network achieves state-of-the-art performance with simple

sliding window search strategy on two image cropping datasets.

HashNet: Deep Learning to Hash by Continuation

Zhangjie Cao , Mingsheng Long , Jianmin Wang , Philip S. Yu Subjects : Learning (cs.LG) ; Computer Vision and Pattern Recognition (cs.CV)

Learning to hash has been widely applied to approximate nearest neighbor

search for large-scale multimedia retrieval, due to its computation efficiency

and retrieval quality. Deep learning to hash, which improves retrieval quality

by end-to-end representation learning and hash encoding, has received

increasing attention recently. Subject to the vanishing gradient difficulty in

the optimization with binary activations, existing deep learning to hash

methods need to first learn continuous representations and then generate binary

hash codes in a separated binarization step, which suffer from substantial loss

of retrieval quality. This paper presents HashNet, a novel deep architecture

for deep learning to hash by continuation method, which learns exactly binary

hash codes from imbalanced similarity data where the number of similar pairs is

much smaller than the number of dissimilar pairs. The key idea is to attack the

vanishing gradient problem in optimizing deep networks with non-smooth binary

activations by continuation method, in which we begin from learning an easier

network with smoothed activation function and let it evolve during the

training, until it eventually goes back to being the original, difficult to

optimize, deep network with the sign activation function. Comprehensive

empirical evidence shows that HashNet can generate exactly binary hash codes

and yield state-of-the-art multimedia retrieval performance on standard

benchmarks.

Artificial Intelligence

Two forms of minimality in ASPIC+

Zimi Li , Andrea Cohen , Simon Parsons Subjects : Artificial Intelligence (cs.AI) ; Logic in Computer Science (cs.LO)

Many systems of structured argumentation explicitly require that the facts

and rules that make up the argument for a conclusion be the minimal set

required to derive the conclusion. ASPIC+ does not place such a requirement on

arguments, instead requiring that every rule and fact that are part of an

argument be used in its construction. Thus ASPIC+ arguments are minimal in the

sense that removing any element of the argument would lead to a structure that

is not an argument. In this brief note we discuss these two types of minimality

and show how the first kind of minimality can, if desired, be recovered in

ASPIC+.

Procedural Content Generation via Machine Learning (PCGML)

Adam Summerville , Sam Snodgrass , Matthew Guzdial , Christoffer Holmgård , Amy K. Hoover , Aaron Isaksen , Andy Nealen , Julian Togelius Subjects : Artificial Intelligence (cs.AI)

This survey explores Procedural Content Generation via Machine Learning

(PCGML), defined as the generation of game content using machine learning

models trained on existing content. As the importance of PCG for game

development increases, researchers explore new avenues for generating

high-quality content with or without human involvement; this paper addresses

the relatively new paradigm of using machine learning (in contrast with

search-based, solver-based, and constructive methods). We focus on what is most

often considered functional game content such as platformer levels, game maps,

interactive fiction stories, and cards in collectible card games, as opposed to

cosmetic content such as sprites and sound effects. In addition to using PCG

for autonomous generation, co-creativity, mixed-initiative design, and

compression, PCGML is suited for repair, critique, and content analysis because

of its focus on modeling existing content. We discuss various data sources and

representations that affect the resulting generated content. Multiple PCGML

methods are covered, including neural networks, long short-term memory (LSTM)

networks, autoencoders, and deep convolutional networks; Markov models,

(n)-grams, and multi-dimensional Markov chains; clustering; and matrix

factorization. Finally, we discuss open problems in the application of PCGML,

including learning from small datasets, lack of training data, multi-layered

learning, style-transfer, parameter tuning, and PCG as a game mechanic.

Multilingual and Cross-lingual Timeline Extraction

Egoitz Laparra , Rodrigo Agerri , Itziar Aldabe , German Rigau

Comments: 20 pages, 7 tables, 7 figures; submitted to Knowledge Based Systems (Elsevier), January, 2017

Subjects

Computation and Language (cs.CL)

; Artificial Intelligence (cs.AI)

In this paper we present an approach to extract ordered timelines of events,

their participants, locations and times from a set of multilingual and

cross-lingual data sources. Based on the assumption that event-related

information can be recovered from different documents written in different

languages, we extend the Cross-document Event Ordering task presented at

SemEval 2015 by specifying two new tasks for, respectively, Multilingual and

Cross-lingual Timeline Extraction. We then develop three deterministic

algorithms for timeline extraction based on two main ideas. First, we address

implicit temporal relations at document level since explicit time-anchors are

too scarce to build a wide coverage timeline extraction system. Second, we

leverage several multilingual resources to obtain a single, inter-operable,

semantic representation of events across documents and across languages. The

result is a highly competitive system that strongly outperforms the current

state-of-the-art. Nonetheless, further analysis of the results reveals that

linking the event mentions with their target entities and time-anchors remains

a difficult challenge. The systems, resources and scorers are freely available

to facilitate its use and guarantee the reproducibility of results.

Information Retrieval

Semantic URL Analytics to Support Efficient Annotation of Large Scale Web Archives

Tarcisio Souza , Elena Demidova , Thomas Risse , Helge Holzmann , Gerhard Gossen , Julian Szymanski Subjects : Information Retrieval (cs.IR)

Long-term Web archives comprise Web documents gathered over longer time

periods and can easily reach hundreds of terabytes in size. Semantic

annotations such as named entities can facilitate intelligent access to the Web

archive data. However, the annotation of the entire archive content on this

scale is often infeasible. The most efficient way to access the documents

within Web archives is provided through their URLs, which are typically stored

in dedicated index files.The URLs of the archived Web documents can contain

semantic information and can offer an efficient way to obtain initial semantic

annotations for the archived documents. In this paper, we analyse the

applicability of semantic analysis techniques such as named entity extraction

to the URLs in a Web archive. We evaluate the precision of the named entity

extraction from the URLs in the Popular German Web dataset and analyse the

proportion of the archived URLs from 1,444 popular domains in the time interval

from 2000 to 2012 to which these techniques are applicable. Our results

demonstrate that named entity recognition can be successfully applied to a

large number of URLs in our Web archive and provide a good starting point to

efficiently annotate large scale collections of Web documents.

Computation and Language

Symbolic, Distributed and Distributional Representations for Natural Language Processing in the Era of Deep Learning: a Survey

Lorenzo Ferrone , Fabio Massimo Zanzotto

Comments: 25 pages

Subjects

Computation and Language (cs.CL)

Natural language and symbols are intimately correlated. Recent advances in

machine learning (ML) and in natural language processing (NLP) seem to

contradict the above intuition: symbols are fading away, erased by vectors or

tensors called distributed and distributional representations. However, there

is a strict link between distributed/distributional representations and

symbols, being the first an approximation of the second. A clearer

understanding of the strict link between distributed/distributional

representations and symbols will certainly lead to radically new deep learning

networks. In this paper we make a survey that aims to draw the link between

symbolic representations and distributed/distributional representations. This

is the right time to revitalize the area of interpreting how symbols are

represented inside neural networks.

Analysing Temporal Evolution of Interlingual Wikipedia Article Pairs

Simon Gottschalk , Elena Demidova

Comments: Published in the SIGIR ’16 Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval

Subjects

Computation and Language (cs.CL)

Wikipedia articles representing an entity or a topic in different language

editions evolve independently within the scope of the language-specific user

communities. This can lead to different points of views reflected in the

articles, as well as complementary and inconsistent information. An analysis of

how the information is propagated across the Wikipedia language editions can

provide important insights in the article evolution along the temporal and

cultural dimensions and support quality control. To facilitate such analysis,

we present MultiWiki – a novel web-based user interface that provides an

overview of the similarities and differences across the article pairs

originating from different language editions on a timeline. MultiWiki enables

users to observe the changes in the interlingual article similarity over time

and to perform a detailed visual comparison of the article snapshots at a

particular time point.