Alexander Barg

Research

Information storage and management in networks: Suppose that information is encoded with a code of length $n$ over a finite alphabet $F$. Coordinates of the codeword are stored at the vertices of a graph $G(V,E)$ with $|V|=n.$ This setting models distributed storage systems in which servers are represented by the vertices, and connections of the graph represent communication constraints in the system. The problems considered here include protocols for the recovery of erased data at the vertices, bounding the communication complexity of recovery, data recovery in the presence of corrupted nodes, dynamic data maintenance in random environment, and other similar questions.

[more/less]

(a) If a server (vertex) becomes unavailable, the encoding supports recovery of the lost data, and we aim at constructing a code that allows recovery with low communication complexity. This problem can be studied from the perspective of locally recoverable codes, aiming at minimizing the number of vertices contacted for data restoration, or of regenerating codes where we optimize the total number of symbols passed along the edges (each hop contributes a unit to the complexity of recovery). These problems can be analyzed both on deterministic graphs and on random graphs from standard ensembles such as ${\mathcal G}_{n,p},$ whereby we aim at finding the threshold probability for vertex recovery. Interesting versions of these problems arise when some functional nodes become corrupted and provide incorrect information about their contents.

(b) Suppose now that the graph $G$ is infinite, for instance, $G={\mathbb Z},$ and we write bi-infinite sequences on the vertices with the constraint that any $k$-tuple of consecutive symbols is a function of its $l$-neigborhood in the graph. The first question is to find the growth rate of this set of sequences, and it can be addressed via a connection with constrained systems. There are multiple possible extensions of this problem such as probabilistic relaxation of the recovery constraint. The analysis relies on methods from constrained systems, symbolic dynamics, and entropy theory. These problems become even more interesting if instead of ${\mathbb Z}$ we store information in higher dimensions, e.g., $G={\mathbb Z}^2,$ linking this subject with Markov random fields.

(c) Another interesting class of problems, formulated here, starts with placing bits on the vertices of a finite graph (one bit per vertex) with the condition that every vertex satisfies a parity check that involves all of its neighbors. The question is to find the size of the largest code that satisfies this condition. It turns out that this question makes the most sense if the graph has no triangles. Surprisingly, for some families of triangle-free graphs it is possible to construct codes of rate asymptotically approaching one; see here and here.

Quantum codes, in particular, qubit codes and their transveral logical gates; permutation invariant codes for correcting deletions and amplitude damping noise; absorption/emission codes for protection against photon loss/gain noise.

Codes and uniform distributions: Here one is interested in characterizing binary codes and codes in other finite metric spaces that approximate the uniform distribution on the space. Applications of such codes could include derandomizing algorithms, approximation theory, probability of decoding error, image processing, and concept learning (uniform laws of large numbers and VC dimension). These problems are also connected with constructing energy-minimizing configurations in metric spaces.

[more/less]

Alexander Barg

Home

Research

Publications

Students

Courses

Editorial Boards

Seminar

Contact

Research