Skip to content

Alexander Van Werde

Menu
  • About Me
  • Blog
  • Events
  • Research
Menu

Research

Preprints

Exact cospectrality probabilities for uniform random matrices. arXiv preprint (2026) [pdf] [arXiv] [supplement]
▷Abstract

We study the conjugation action of orthogonal matrices on symmetric random matrices. Given a fixed orthogonal matrix over an algebraic number field and a random matrix with entries sufficiently uniform in the ring of integers, we wonder what the probability is that the conjugate is again integral. Our main result establishes an exact formula for this probability in terms of the Smith ideals associated to the orthogonal matrix.

As an illustrative application, we establish exact formulas for the expected number of rational orthogonal matrices that preserve the integrality of a random matrix for every fixed denominator in dimensions two and three. Notably, the dependence on the denominator turns out to be non-monotone due to number-theoretic fluctuations. We also prove bounds on the probability of rational cospectrality with bounded but arbitrarily large denominator.

A sufficient condition for generalized spectral characterization of graphs with loops. arXiv preprint (2025). [pdf] [arXiv]
▷Abstract

Sufficient conditions for a simple graph to be characterized up to isomorphism given its spectrum and the spectrum of its complement graph are known due to Wang and Xu. This note establishes a related sufficient condition in the presence of loops: if the walk matrix has square-free determinant, then the graph is characterized by its generalized spectrum. The proof includes a general result about symmetric integral matrices.

Worst-case mixing estimates for Brownian motion with semipermeable barriers. With Jaron Sanders. arXiv preprint (2025). [pdf] [arXiv]
▷Abstract

We study the mixing properties of a Brownian motion whose movements are hindered by semipermeable barriers. Our setting assumes that the process takes values in a smooth planar domain and that the barriers are one-dimensional closed curves. We establish an upper bound on the mixing time and a lower bound on the stationary distribution in terms of geometric parameters. These worst-case bounds decay at an exponential rate as the domain grows large, and we give examples that show that exponential decay is necessary in our worst-case setting.

Recovering semipermeable barriers from reflected Brownian motion. With Jaron Sanders. arXiv preprint (2024). [pdf] [arXiv] [GitHub]
▷Abstract

We study the recovery of one-dimensional semipermeable barriers for a stochastic process in a planar domain. The considered process acts like Brownian motion when away from the barriers and is reflected upon contact until a sufficient but random amount of interaction has occurred, determined by the permeability, after which it passes through. Given a sequence of samples, we wonder when one can determine the location and shape of the barriers.

This paper identifies several different recovery regimes, determined by the available observation period and the time between samples, with qualitatively different behavior. The observation period T dictates if the full barriers or only certain pieces can be recovered, and the sampling rate significantly influences the convergence rate as T→∞. This rate turns out polynomial for fixed-frequency data, but exponentially fast in a high-frequency regime.

Further, the environment’s impact on the difficulty of the problem is quantified using interpretable parameters in the recovery guarantees, and is found to also be regime-dependent. For instance, the curvature of the barriers affects the convergence rate for fixed-frequency data, but becomes irrelevant when T→∞ with high-frequency data.

The results are accompanied by explicit algorithms, and we conclude by illustrating the application to real-life data.

Sharp concentration for sums of matrices with Markovian dependence through universality. With Jaron Sanders. arXiv preprint (2023). [pdf] [arXiv] [YouTube]
▷Abstract

We prove that a sum of random matrices generated by a ψ-mixing Markov chain has similar spectral properties to a Gaussian matrix with the same mean and covariance structure. This nonasymptotic universality principle enables sharp concentration inequalities when combined with recent advances in the Gaussian literature. We illustrate the theory with examples, showing how it enables polynomial dimensional improvements relative to previous Markovian matrix concentration results when applied to Wigner-type matrices, and how one can recover sharp limiting values for a model used to study of spectral clustering techniques.

A key challenge in the proof is that techniques based only on classical cumulants, which can be used when summands are independent, are not sufficient on their own for efficient estimates in a Markovian setting. Our approach exploits Boolean cumulants and a change–of–measure argument.

Published works

Detection and evaluation of clusters within sequential data. With Albert Senen-Cerda, Gianluca Kosmella, and Jaron Sanders. Data Mining and Knowledge Discovery (2025). [pdf] [arXiv] [Journal] [GitLab] [PyPi]
▷Abstract

Sequential data is ubiquitous—it is routinely gathered to gain insights into complex processes such as behavioral, biological, or physical processes. Challengingly, such data not only has dependencies within the observed sequences, but the observa- tions are also often high-dimensional, sparse, and noisy. These are all difficulties that obscure the inner workings of the complex process under study. One solution is to calculate a low-dimensional representation that describes (characteristics of) the complex process. This representation can then serve as a proxy to gain insight into the original process. However, uncovering such low-dimensional representa- tion within sequential data is nontrivial due to the dependencies, and an algorithm specifically made for sequences is needed to guarantee estimator consistency. For- tunately, recent theoretical advancements on Block Markov Chains have resulted in new clustering algorithms that can provably do just this in synthetic sequential data. This paper presents a first field study of these new algorithms in real-world sequen- tial data; a wide empirical study of clustering within a range of data sequences. We investigate broadly whether, when given sparse high-dimensional sequential data of real-life complex processes, useful low-dimensional representations can in fact be extracted using these algorithms. Concretely, we examine data sequences containing GPS coordinates describing animal movement, strands of human DNA, texts from English writing, and daily yields in a financial market. The low-dimensional rep- resentations we uncover are shown to not only successfully encode the sequential structure of the data, but also to enable gaining new insights into the underlying complex processes.

Cokernel statistics for walk matrices of directed and weighted random graphs. Combinatorics, Probability and Computing (2024). [pdf] [arXiv] [Journal]
▷Abstract

The walk matrix associated to an n×n integer matrix X and an integer vector b is defined by W:=(b,Xb,…,Xn−1b). We study limiting laws for the cokernel of W in the scenario where X is a random matrix with independent entries and b is deterministic. Our first main result provides a formula for the distribution of the pm-torsion part of the cokernel, as a group, when X has independent entries from a specific distribution. The second main result relaxes the distributional assumption and concerns the Z[x]-module structure.

The motivation for this work arises from an open problem in spectral graph theory which asks to show that random graphs are often determined up to isomorphism by their (generalized) spectrum. Sufficient conditions for generalized spectral determinacy can namely be stated in terms of the cokernel of a walk matrix. Extensions of our results could potentially be used to determine how often those conditions are satisfied. Some remaining challenges for such extensions are outlined in the paper.

Estimates for zero loci of Bernstein-Sato ideals. With Nero Budur and Robin van der Veer. Publications of the Research Institute for Mathematical Sciences (2024). [pdf] [arXiv] [Journal]
▷Abstract

We give estimates for the zero loci of Bernstein-Sato ideals. An upper bound is proved as a multivariate generalisation of the upper bound by Lichtin for the roots of Bernstein-Sato polynomials. The lower bounds generalise the fact that log-canonical thresholds, small jumping numbers of multiplier ideals, and their real versions provide roots of Bernstein-Sato polynomials.

Singular value distribution of dense random matrices with block Markovian dependence. With Jaron Sanders. Stochastic Processes and their Applications (2023). [pdf] [arXiv] [Journal]
▷Abstract

A block Markov chain is a Markov chain whose state space can be partitioned into a finite number of clusters such that the transition probabilities only depend on the clusters. Block Markov chains thus serve as a model for Markov chains with communities. This paper establishes limiting laws for the singular value distributions of the empirical transition matrix and empirical frequency matrix associated to a sample path of the block Markov chain whenever the length of the sample path is Θ(n²) with n the size of the state space.

The proof approach is split into two parts. First, we introduce a class of symmetric random matrices with dependence called approximately uncorrelated random matrices with variance profile. We establish their limiting eigenvalue distributions by means of the moment method. Second, we develop a coupling argument to show that this general-purpose result applies to block Markov chains.

Theses

Latent structures in Markovian processes and associated random matrices. Awarded with honors (top 5% of PhD graduates), Eindhoven University of Technology (2025). [pdf] [TU/e] [Honors distinction]
▷Summary

Sequential data is essential for the understanding of processes occurring in various areas, ranging from natural language processing to ecology and microbiology. However, deciphering the meaning hidden within the sequential order of data points can be difficult in practice as observations are often noisy and high-dimensional, and the underlying processes are typically complex. Adopting a mathematical perspective, this thesis pursues fundamental understanding of algorithms that can uncover latent structures encoding the key dynamics of such processes. Chapter 1 introduces the main themes and summarizes our contributions.

The first part of this thesis investigates a stochastic process whose movements are hindered by barriers in the environment, referred to as Brownian motion with semipermeable barriers. Given an observed sequence of samples from the process, we consider the problem of recovering the shape and location of the barriers. Our goal is to understand how the difficulty of this recovery problem depends on the structure of the environment, including the geometry of the barriers and their permeability, and how it depends on the nature of the data determined by the intersample time and the total observation period. We start these investigations in Chapter 2 by proving basic properties related to the well-definedness of the process, and by giving key proof techniques used in the subsequent chapters.

Our results regarding the recovery problem are rigorously stated and proved in Chapters 3 and 4 and reveal that the problem has qualitatively different behavior depending on the nature of the available data. The observation period dictates if one can recover the full barriers or only certain pieces. The sampling rate is found to influence the rate of decay for the approximation error significantly: while this rate is polynomial in a fixed-frequency regime, it is exponentially fast given high frequency data. How the environment enters the problem is also found to be regime-dependent. For instance, the curvature of the barriers turns out to be key in the convergence rate for fixed-frequency data but becomes asymptotically irrelevant for the convergence rate in a high-frequency regime; see Theorems 3.1 and 4.3. Two of the parameters used in the results depend on the mixing properties of the process, and their direct dependence on the environment is not necessarily transparent. We hence investigate to what extent the mixing properties can be controlled in terms of geometric and permeability-based quantities in Chapter 5 where we establish worst-case estimates.

The second part of this thesis investigates spectral methods for uncovering structure from data, including clustering algorithms for sequential data and the spectral characterization of networks. The term “spectral” indicates that such methods exploit the eigendecomposition or singular value decomposition of a matrix constructed based on the data. When the data is random to some extent, the constructed matrix will also be so. A fundamental challenge in understanding spectral methods is hence to analyze the properties of random matrices. While the properties of matrices constructed from independent data are well-understood, however, this is not so for matrices constructed from sequential data because dependencies make the mathematical analysis of a system more difficult. Throughout Chapters 6 to 10 we develop new proof techniques applicable to random matrices in the presence of sequential dependencies.

The motivation for Chapters 6 to 9 arises from block Markov chains, a mathematical model for Markov chains with ground-truth clusters in their state space. Understanding the properties of random matrices is here essential for the analysis of spectral clustering algorithms which recover the underlying clusters based on a sequence of observations. Chapter 6 characterizes the limiting singular value distributions in a regime where the sample path is sufficiently long to construct a dense matrix. The analysis of spectral algorithms in particular requires bounds on the distance of the constructed random matrix to its expected value measured in the operator norm. Chapter 7 hence establishes general-purpose matrix concentration results for random matrices generated by Markov chains. Those results are illustrated by example with block Markov chains in Chapter 8 to get a concentration estimate whose leading-order term is asymptotically sharp, and we there also improve the results from Chapter 6 regarding singular value distributions to permit sparse matrices. Aside from our focus on proof-technical challenges coming from the dependence, Chapter 9 also briefly explores the application of clustering methods and our theoretical results to real-world sequential data.

Finally, the results in Chapter 10 are motivated by an open problem in spectral graph theory which asks whether eigenvalues of associated matrices typically provide enough information to characterize a network uniquely. Deterministic sufficient conditions for a graph to be characterized by its generalized spectrum can be stated in terms of the number-theoretic properties of the walk matrix of the graph, but it is not known how often those conditions are satisfied. To make progress towards the latter question, we develop new proof techniques for studying the walk matrix of a random graph, exploiting that the columns of this matrix can be interpreted as a stochastic process in the analysis.

Relative Holonomicity and Estimation of Bernstein-Sato Zero Loci. Masters’ degree obtained Summa cum laude with congratulations of the Board of Examiners, KU Leuven (2021). [Thesis]
▷Summary

This masters’ thesis concerned topics in algebraic geometry surrounding D-modules and Bernstein-Sato ideals. After some preliminaries on derived category theory and spectral sequences, the first two chapters established new estimates for the zero loci of Bernstein-Sato ideals that generalized previous monovariate results from the literature. (These results have since been published in PRIMS; see also arXiv:2111.03334.) The final chapter explored how a notion from from D-module theory known as holonomicity can be used to derive recursions for the moments of a random variable.

Figures

They say a figure speaks thousand words. The following figures illustrate some of my research from the papers and preprints above.

  • A simulation of a Brownian motion with semipermeable barriers. My paper “Recovering semipermeable barriers from reflected Brownian motion” with Jaron Sanders investigated the statistical problem whose goal is to recover the underlying barriers given a finite sequence of samples from the process.
  • My paper “Recovering semipermeable barriers from reflected Brownian motion” with Jaron Sanders investigated the statistical problem of uncovering barriers to a stochastic process. This figure shows some natural barriers which were uncovered by applying the algorithms from that paper to tracking data of reindeer in Svalbard.
  • Spectral graph theory is a powerful mathematical discipline which studies properties of graphs using the eigenvalues of associated matrices. A significant open question asks whether it is typical for spectral information to uniquely characterize the graph. I recently made some progress towards this problem by developing new proof techniques using random matrix theory; see arXiv:2401.12655.
  • A block Markov chain is a Markovian model which comes equipped with a ground-truth notion of clusters in the state space. Some of my work in random matrix theory is motivated by a desire to understand the performance of spectral clustering algorithms whose goal is to uncover the clusters based on an observed trajectory.
  • Clusters detected based on movements of bisons in a field. Note that the borders of the clusters uncover geographical features, such as rivers, in the landscape. The employed clustering algorithm does not have prior access to these features but rather uncovers them based on solely the animal movements.
  • My paper “Singular value distribution of dense random matrices with block Markovian dependence.” with Jaron Sanders investigated a question in random matrix theory: what do the singular values of a random matrix generated by a block Markov chain look like? This figure shows our theoretical predictions as the dark blue line as well as the empirically observed distributions as the histogram.
  • My paper “Estimates for zero loci of Bernstein-Sato ideals” with Nero Budur and Robin van der Veer investigated a topic in algebraic geometry related to invariants of singularities. This figure visualizes our key interest for that paper: the zero locus of a Bernstein-Sato ideal.

Talks, posters, and more

Upcoming talk in the “New Researcher Session” at the IMS annual meeting in Salzburg, Austria. (6-9 July 2026)
Session organizer: Combinatorial aspects of random matrix theory at the IMS annual meeting in Salzburg, Austria. (6-9 July 2026)
Upcoming talk at the BeNeLux mathematical congress. (7-8 April 2026)
Session organizer: Random Matrices and Random Graphs With Arno Kuijlaars, at the BeNeLux mathematical congress. (7-8 April 2026)
Upcoming talk: Random matrices for the working probabilist at the SPOON Seminar of the University of Münster. (22 January 2026)
Upcoming talk: On the spectral characterization of random graphs at the Forschungsseminar Stochastische Geometry und räumliche Statistik of the University of Ulm. (13 January 2026)
Talk: Spectral characterization of graphs, and random abstract-algebraic objects for Oberseminar Stochastik at the University of Münster. (2025)
Talk: Towards generalized spectral determinacy of random graphs at Kolloquium über Kombinatorik (Bielefeld University). (2025)
Talk: How much information is in a network’s spectrum? for the NEO Seminar at Université Côte d’Azur. (2025)
Talk: Universality-based concentration for matrices generated by a Markov chain for Séminaire de Probabilités et Statistique at Labratoire Jean Alexandere Dieudonné. (2025)
Talk: Universality-based concentration for matrices generated by a Markov chain for the DACO Seminar at ETH Zurich. (2025)
Newsletter feature: Making sense of complex processes — Uncovering hidden structure in sequential data by Danai Bouri at Eindhoven University of Technology. (2025) [link]
Poster: Towards generalized spectral determinacy of random graphs at the 22nd INFORMS Applied Probability Society Conference (APS Atlanta). (2025) [pdf]
Talk: Recovering semipermeable barriers from reflected Brownian motion at the 22nd INFORMS Applied Probability Society Conference (APS Atlanta). (2025) [slides]
Poster: Sharp concentration for sums of matrices with Markovian dependence through universality at the 20th Young European Probabilists workshop.(YEPXX, Eurandom). (2025)
Poster: Towards generalized spectral determinacy of random graphs at the 60th Dutch Mathematical Congress. (2025) [pdf]
Talk: Recovering semipermeable barriers from reflected Brownian motion for the Royal Dutch Mathematical Society’s PhD Prize competition. (2025)
Talk: Towards generalized spectral determinacy of random graphs at the Algebraic Graph Theory seminar. (2025) [slides] [YouTube]
Talk: Towards generalized spectral determinacy of random graphs at the Dutch Network Science Symposium. (2024)
Newsletter feature: Visiting ICERM as a Graduate Student Researcher. With Han Le. Published in ICERM’s newsletter. (2024) [link]
Poster: Matrix concentration inequalities with dependent summands and sharp leading-order terms at Bernoulli-IMS 11th World Congress in Probability and Statistics. (2024) [pdf]
Talk: Cokernel statistics of walk matrices: a random matrix approach towards generalized spectral determinacy of random graphs at 6th Dutch-Belgian discrete mathematics seminar. (2024) [slides]
Talk: Cokernel statistics of walk matrices: Towards generalized spectral determinacy of random graphs at TU/e lunch seminar in Combinatorial Optimization. (2024) [slides]
Talk: Matrix concentration inequalities with dependent summands and sharp leading-order terms at SNAPP Seminar. (2023) [slides] [YouTube]
Discussion Paper Contribution: ‘Vintage Factor Analysis with Varimax Performs Statistical Inference’ by Rohe & Zeng published at Journal of the Royal Statistical Society Series B: Statistical Methodology. (2023) [Journal]
Talk: Matrix concentration inequalities with dependent summands and sharp leading-order terms at KU Leuven Classical Analysis Seminar. (2023) [slides]
Talk: Detection and evaluation of clusters within animal movements at Stochastic Models in Life Science (Eurandom). (2023) [slides]
Poster: Universality-based concentration for sums of dependent random matrices at 21st INFORMS Applied Probability Society Conference (APS Nancy). (2023) [pdf] [Best poster award]
Talk: Detection and evaluation of clusters within sequential data at 21st INFORMS Applied Probability Society Conference (APS Nancy). (2023) [slides]
Talk: Matrix concentration inequalities with dependent summands and sharp leading-order terms at 10th High Dimensional Probability conference (Będlewo). (2023) [abstract] [slides]
Talk: Universality-based concentration for sums of dependent random matrices at 9th SOR PhD Colloquium. (2023) [abstract]
Poster: Universality-based concentration for sums of dependent random matrices at 18th Young European Probabilists workshop (YEP XVIII). (2023)
Talk: Clusters within Markov chains: Detection, evaluation, and spectral fingerprints at 14th NETWORKS training week. (2022) [slides]
Talk: Bulk of random matrices generated by Markov chains with community structure at fourth ZiF summer school on randomness in physics and mathematics. (2022) [slides]
Talk: Spectra of random matrices with Markovian dependence and non-constant variance profile at 50th Saint-Flour probability summer school. (2022) [abstract]
Poster: Singular value distribution of random matrices with block Markovian dependence at XVII Brunel-Bielefeld Workshop. (2021) [pdf]
Transdisciplinary experience (as a student): Designing future perspectives for dairy farming in Flanders with Coetzer et. al. Published at Transdisciplinary insights (2019) [Journal]

Contact info:
a.van.werde@uni-muenster.de
Seminarraumzentrum building, Room 304
Orléans-Ring 12, 48149 Münster, Germany

© 2026 Alexander Van Werde