The remarkable Hilbert space H^2 (Part I – definition and interpolation theory)

by Orr Shalit

This series of posts is based on the colloquium talk that I was supposed to give on November 20, at our department. As fate had it, that week studies were cancelled.

Several people in our department thought that it would be a nice idea if alongside the usual colloquium talks given by invited speakers which highlight their recent achievements, we would also have some talks by department members that will be more of an exposition to the fields they work in. So my talk was supposed to be an exposition to the setting in which much of the research I do goes on.

The topic of the “talk”  is the Hilbert space H^2_d. There will be three parts to this series:

  1. Definition and interpolation theory. 
  2. Multivariate operator theory and model theory
  3. Current research problems

1. Introduction

What is H^2?

H^2 is a Hilbert space. One may ask: what could be remarkable about a Hilbert space? A Hilbert space is a Hilbert space, and they are all isomorphic, are they not? 

This is a real question, and I was actually asked this question by a member of my department a week before this talk, after the abstract was published. I have two answers to this question.

The first answer is that H^2 is not just a Hilbert space, it is a Hilbert function space, so it has a much richer structure than a mere Hilbert space. The function theory that arises in the context of the space H^2 connects in a very fruitful way with the Hilbert space structure. More on this soon.

The second answer is that whenever we pick a particular construction of Hilbert space to work in, we are choosing a representation for some object of interest. In other words, the particular Hilbert space we choose to work with comes along with a set of natural operators. As an example, consider a countable group G, and consider \ell^2(G). This is the same Hilbert space as \ell^2(\mathbb{N}), but \ell^2(G) invites us to represent G on it in a very natural way, while \ell^2(\mathbb{N}) makes no such invitation. The operator theory that naturally arises in the context of the space H^2 is what makes this Hilbert space special. More on this later.

2. The function space H^2

In fact, there is a sequence of spaces H^2_d which interest us. Let d be a positive integer or \infty.  Let B_d denote the open unit ball in \mathbb{C}^d (where \mathbb{C}^d is understood to be \ell^2 when d=\infty). I ask of you, for this talk, don’t get your mind bogged with questions about the d = \infty case, in fact, if you are not an operator theorist you may as well take d=1, things are interesting enough. We define H^2_d to be the space of holomorphic functions f : B_d \rightarrow \mathbb{C}^d with Taylor series

f(z) = \sum_{n \in \mathbb{N}^d} a_n z^n

which satisfies

(*)   \|f\|^2 = \sum_n \frac{n!}{|n|!}|a_n|^2 < \infty.

Here we are using the standard multi–index notation: for n = (n_1, \ldots, n_d) \in \mathbb{N}^d we put n! = n_1! n_2 ! \cdots n_d! and |n| = n_1 + \ldots + n_d (and of course \mathbb{N} = \{0,1,2,\ldots,\}. Also, \mathbb{N}^\infty is the sum, not product).

Equation (*) defines a Hilbert space norm on H^2_d and it is very easy to figure out what the inner product has to be.

So H^2_d can be very naturally identified with a weighted \ell^2–space, but we really want to think about it as a space of functions. These are not equivalence classes of functions, like we have when we look at the space L^2[0,1], these are honest–to–God analytic functions that have well defined values at every point. The crucial fact is that point evaluation is a bounded functional on H^2_d. The easiest way to show this is to exhibit for every w \in B_d, an element k_w(z) \in H^2_d such that for all f,

\langle f, k_w \rangle = f(w) .

A simple computation (using the orthogonality of the monomials) shows that the unique function that satisfies this is

k_w(z) = \frac{1}{1 - \langle z, w \rangle} .

Let us carry out the computation in the case d = 1. Let f(z) = \sum a_n z^n, and k_w(z) = \frac{1}{1-z \overline{w}} = \sum (z\overline{w})^n. Then

\langle f, k_w \rangle = \langle \sum a_n z^n, \sum \overline{w}^n z^n \rangle = \sum a_n w^n = f(w).

The fact that point evaluation is a bounded linear functional is the starting point of an intimate relationship between the Hilbert space structure and the operator theory of H^2_d, on the one hand, and the function theory of H^2_d, on the other hand. It is remarkable that both sides have benefited from this relationship.

I personally find the more interesting (or surprising) side of this story to be that operator theory has applications to complex function theory. I will tell you about my favorite example.

3. Nevanlinna–Pick interpolation

Let z_1, \ldots, z_k be points in the unit disc D, and let w_1, \ldots, w_k be complex numbers. One can always find an analytic function f :D \rightarrow \mathbb{C} that interpolates this data, meaning that f(z_i) = w_i for i=1, \ldots, k. This is easy to do with polynomials. However, for some applications such as control theory (and also for the glory of human kind) it is desirable to find an optimal solution to this interpolating problem. For example, one would like to find an analytic function f :D \rightarrow \mathbb{C} that interpolates data and has the smallest possible sup norm

\|f\|_\infty = \sup_{z \in D}|f(z)| .

It is not hard to see that we will be able to figure out what is the minimal norm of an interpolating function if we know how to solve the following problem. Denote by H^\infty = H^\infty(D) the Banach algebra of bounded analytic functions on the disc with the sup norm.

Nevanlinna–Pick interpolation problem: Given z_1, \ldots, z_k,\in D and w_1, \ldots, w_k \in D, does there exist f \in H^\infty, with \|f\|_\infty \leq 1 that satisfies f(z_i) = w_i for i = 1, \ldots, k?

 G. Pick (1916) and R. Nevanlinna (1919) independently solved this problem. They provided the following very satisfying solution.

Theorem 1: The Nevanlinna–Pick problem has a solution if and only if the matrix 

\left(\frac{1-w_i \overline{w}_j}{1-z_i \overline{z}_j} \right)_{i,j=1}^k .

is positive definite. 

This is a very satisfying solution because given the data you can actually form this matrix and check whether or not it is positive definite.

In 1967 D. Sarason introduced a new approach to this problem, which could simultaneously treat Nevanlinna–Pick interpolation problems as well as other interpolation problems of interest. His approach used operator theory on H^2 (the case d=1) in an essential way, and among other things, it gave the following result (first proved by Sz.-Nagy–Koryani, also by operator theoretic techniques).

Theorem 2: Given z_1, \ldots, z_k,\in D and W_1, \ldots, W_k \in M_n(\mathbb{C}), there exists a bounded analytic matrix valued function f, with \|f\|_\infty \leq 1 that satisfies f(z_i) = W_i for i = 1, \ldots, k, if and only if the nk \times nk matrix 

\left(\frac{1-W_i {W}^*_j}{1-z_i \overline{z}_j} \right)_{i,j=1}^k

is positive definite. 

So what do these beautiful theorems have to do with our space? It seems as if the problem is in the wrong space: we just introduced the Hilbert space H^2, but in the NP problem we are looking for a function in H^\infty. It turns out that H^\infty is very closely related to H^2. H^\infty is equal to the so–called multiplier algebra of H^2, that is,

H^\infty = \{f : D \rightarrow \mathbb{C} : \forall h \in H^2 . fh \in H^2 \} .

Moreover, the space of bounded analytic n \times n matrix valued functions is the multiplier algebra of the space of vector valued functions H^2 \otimes \mathbb{C}^n. This simple connection allows us to harness all the power of operator theory to the function theoretic NP problem.

4. Reproducing kernel Hilbert spaces and complete NP kernels

Our discussion fits in a larger framework.

Definition 3: Let X be a set. A reproducing kernel Hilbert space (RKHS for short, also called a Hilbert function space) is a Hilbert space H that consists of functions X \rightarrow \mathbb{C}, in which point evaluation at any point x \in X is a bounded functional on H.

Since point evaluation is bounded, we have, for any x \in X, a unique k_x \in H such that

(*)  \forall f \in H . f(x) = \langle f, k_x \rangle .

So k_x is itself a function on X. The functions k_x are called kernel functions. Denote k(x,y) = k_y(x) = \langle k_y, k_x \rangle. The function k: X \times X \rightarrow \mathbb{C} satisfies that for every x_1, \ldots, x_n \in X, the matrix

\big(k(x_i,x_j) \big)_{i,j=1}^n

is positive definite. Such a function is said to be a positive definite kernel. It is also referred to as a reproducing kernel, because the kernel functions “reproduce” the functions in H by (*). It is a fact (known as Aronszajn’s Theorem) that every positive definite kernel is the reproducing kernel of a RKHS. If k is a positive definite kernel, one sometimes denotes by H(k) the RKHS that has k as its reproducing kernel.

Every RKHS H has a multiplier algebra, defined

Mult(H) = \{ f: X \rightarrow \mathbb{C} : \forall h \in H . f h \in H\} .

The multplier algebra has a natural norm:

\|f\|_{Mult(H)} = \sup \{\|fh\|_H : h \in H, \|h\| \leq 1\} .

The matrix valued NP problem makes sense in any multiplier algebra:

Matrix valued NP interpolation problem: Given x_1, \ldots, x_n,\in X and W_1, \ldots, W_n \in M_N(\mathbb{C}), does there exist f \in Mult(H) \otimes M_N(\mathbb{C}), with \|f\| \leq 1 that satisfies f(x_i) = W_i for i = 1, \ldots, n?

To clarify, Mult(H) \otimes M_N(\mathbb{C}) can be simply considered as the algebra  of N \times N matrices with entries in Mult(H). This algebra acts naturally on the Hilbert space H \oplus \ldots \oplus H (N times), and the norm is the operator norm.

Many RKHS are known, and many have been studied. In some of them there is a nice solution to the NP interpolation problem, in some of them there is a solution but it is not nice, and in some of them nobody knows a characterization of when the problem is solvable. The most favorable case is the following one:

Definition 4: A kernel k is said to be a complete Nevanlinna–Pick kernel (or, for short, a complete NP kernel) if for all N, the matrix valued NP interpolation problem for x_1, \ldots, x_n,\in X and W_1, \ldots, W_n \in M_N(\mathbb{C}), has a solution in Mult(H(k)) \otimes M_N(\mathbb{C}) of norm less than or equal to 1 if and only if the matrix

\left( (1-W_i W_j^* ) k(x_i, x_j) \right)_{i,j=1}^n

is positive definite. In this case, H(k) is said to be a complete NP space. A multiplier algebra of complete NP space is said to be a complete NP algebra.

I hope nobody will confuse between “complete NP” and “NP complete”.

Remark: Sometimes one uses the terminology “complete Pick” instead of “complete Nevanlinna–Pick”.

Theorem 2 can be restated by saying that the kernel k(z,w) = \frac{1}{1-z\overline{w}} (known as the Szego kernel) is a complete NP kernel. Now, this kernel is the kernel for H^2_1. It is a fact (proved by Arias–Popescu, Davidson–Pitts, and Agler–McCarthy) that the kernel of H^2_d is a complete NP kernel for all d.

Theorem 5: For all d, H^2_d is a complete NP space. 

Thus, NP interpolation problem in Mult(H^2_d) has a very nice solution.

5. Universality of H^2_d

Are there any other complete NP spaces besides H^2_d? Yes, there are. The Sobolev space W^{1,2}([0,1]) as well as the Dirichlet space are complete NP spaces, for example. These spaces look very different from H^2_d; especially the Sobolev space, which is not even a space of analytic functions. However, the following remarkable theorem of Agler and McCarthy shows H^2_d is the universal complete NP space.

Let us say that a kernel k is irreducible if for all x,y \in X, k_x and k_y are linearly independent, but not orthogonal.

Theorem 6: Let k be an irreducible kernel on the set X, and suppose that H = H(k) is separable. Then k has the complete NP property if and only if there exist d = 1, 2, \ldots, \infty, an injection b: X \rightarrow B_d and a nowhere vanishing function c on X such that 

k(x,y) = \frac{c(x) \overline{c(y)}}{1 - \langle x,y \rangle} .

The theorem has the following consequence:

Corollary 7: Let M be a complete NP multiplier algebra. Then there is a d and an analytic variety V \subseteq B_d such that 

M \cong Mult(H^2_d)\big|_V = \{f\big|_V : f \in Mult(H^2_d)\} .

The symbol \cong stands for completely isometrically isomorphic.

Advertisements