MIF++ is the interdisciplinary seminar organised by the DSTA group
- Vision of the MIF++ seminar : enrich materials science with mathematics.
- Style of MIF++ seminars : who is welcome to participate and contribute.
- MIF++ timetable : every two weeks on average, on zoom from March 2020.
- Annual conference MACSMIN is a focused 1-week version of MIF++ seminars.
Vision of the MIF++ seminar : enrich materials science with mathematics
- The Materials Innovation Factory (MIF) is the £82M research institute, which was officially opened in 2018.
- The vision is to provide collaborative opportunities for colleagues on the interfaces of mathematics and materials.
- Before Covid, the MIF++ seminar was run for 2 hours including a talk combined with lunch and informal discussions.
- The aim is to connect colleagues, who are experts from different areas and would like to start a joint research project.
- Since March 2020, the MIF++ seminar is on zoom for 45-50 min plus 10-15 min for questions, no free lunch online, sorry.
Back to Top of this page | Back to Home page
Style of network seminars and who is welcome to participate and contribute
- An initial presentation could be self-contained and understandable to non-experts, aimed at first year undergraduates.
- An applied expert (in a broad sense including chemistry, physics and structural biology) could explain the basic concepts and clearly state a real problem, where computational methods can potentially help, and describe the state-of-the-art.
- A computational expert (in computer science, mathematics, electrical engineering) could start from the basic concepts and explain their approach on examples, and describe real data that their on which methods guarantee correct results.
- Since September 2021 our group presents regular advances in a new research area of Geometric Data Science.
Back to Top of this page | Back to Home page
Network seminars (in the reverse chronological order) : 2024 | 2023 | 2022 | 2021 | 2020 | 2019
The MIF++ seminar is usually on Mondays at 2 pm UK time. If you would like to join or give a talk, please e-mail. See easier and more educational talks on Data Science at the PhD training sessions joint with the DSTA group seminar.
- 9 December 2024 14-15 UK time.
- Elspeth F. Garman (Oxford University, UK)
- Title. Two different challenges in biological structure determination: radiation damage and metal identification.
- Abstract. Following on from Professor Wlodawer’s 18th November 2024 salutory MIF++ talk on a smorgasboard of infelicities in Protein Data Bank (PDB) structures, in this talk I will describe two additional challenges in macromolecular structure determination. The PDB is a critical resource for researchers worldwide. In 2021, there were on average 1.86 million downloads per day in the US alone which suggests that over 350,000 models downloaded per day may not contain the correct metal.
The first of these challenges is that of radiation damage (RD) artefacts induced by X-rays absorbed by the crystalline sample during the experiment. The diffracted intensity spot fading caused by RD progression during the experiment has long been used as a metric for monitoring the state of both room temperature [1] and cryo-condition [2] samples. Recent developments in micro-electron diffraction (µED) have noted similar pathologies [e.g. 3]. Being aware of this and other RD symptoms is important [4], since even for samples held at 100 K, RD is a limiting problem. It can prevent structure determination and the effects can mislead the experimenter when interpreting the relevant biology of the structure. Pertinent to this challenge, I will describe our single metric, Bnet, by which the level of damage in a single PDB entry from a cryo-cooled crystal (i.e. not accompanied by a dose series) can be assessed [5], and which we have used to assess RD in 93,978 deposited 100 K structures.
Secondly, I will address the issue of accurately identifying metal atoms bound to protein structures. Metalloproteins comprise over one-third of proteins, with approximately half of all enzymes requiring metal to function. Identifying the bound metal and its environment is a prerequisite to understanding biological mechanism. However, there are no routine analysis methods with the sensitivity and quantitative accuracy to do this unambiguously. We have previously developed microProton Induced X-ray Emission (PIXE) as a tool for quantifying metals in liquid and crystalline proteins using the known sulphur content (methionines and cysteines) as an internal standard [6]. We have now automated this method to permit unattended high throughput analysis of many samples, validating the approach by using it to analyse three distinct sets of 30 proteins identified as metalloproteins in the PDB. In all three sets, we found that over half of the metals had been misidentified in the deposited PDB models. Some of the PIXE-detected metals not seen in the models were explainable as artefacts from promiscuous crystallization reagents. For others, using the correct metal improved the structural models and identified new functionality. This has profound implications for those using the models, whose understanding of them may therefore be flawed.
References:
[1] C.C.F. Blake, D.C. Phillips In Proceedings of the Symposium on the Biological Effects of Ionising Radiation at the Molecular Level (Vienna: International Atomic Energy Agency), (1962) pp. 183–191.
[2] A. Gonzales, C. Nave, Acta Cryst. D (1994) 50, 874-877.
[3] J Hattne Methods Mol. Biol. (2021) 2215, 309-319.
[4] EF Garman, M Weik, (2023) Current Opinion Struct. Biol. 2, 102662.
[5] KL Shelley, EF Garman (2022) Nature Communications 13,1314- 1325.
[6] GW Grime, EF Garman (2023) Nuclear Inst. & Methods in Physics Research, B 540, (2023) 237–245.
- 2 December 2024 14-15 UK time.
- Abbie Trewin (Lancaster University, UK)
- Title. Mesoscale artificial synthesis of conjugated microporous polymers [based on this paper]
- Abstract. This work reports the mesoscale artificial synthesis of a conjugated microporous polymer, CMP-1, using a hybrid coarse-grained methodology. Whilst using a coarse grain approach does give a lower density and surface area when compared to the all-atom equivalent, this allowed a simulation cell volume scale-up of up to 64 times, and an overall speed-up factor of 44% when compared to the all-atom equivalent.
- 25 November 2024 14-15 UK time.
- Gregory McColm (University of South Florida, US)
- Title. The Crystal Net Isomorphism Problem.
- Abstract. A covalent crystal may be represented as a graph, with atoms or molecular building blocks as vertices and bonds as edges. Such a graph can be called a crystal net, a crystal structure, or a periodic graph. Two such graphs are often regarded as equivalent if they are isomorphic, so isomorphism is critical to many modern cataloguing systems. While isomorphism is readily established for certain nice classes of crystal nets, the general case is at least as hard as the (finite) Graph Isomorphism Problem, for when no fast algorithm is known. We look at the Crystal Net Isomorphism Problem from the point of view of voltage graphs.
- 18 November 2024 14.00-15.00 UK time. Video (44 min) Slides (81 M)
- Alex Wlodawer (Center for Structural Biology, National Cancer Institute, US)
- Title. How not to publish structural results.
- Abstract. High-resolution macromolecular structures determined using crystallography, NMR, and cryo-EM provide a gold standard for evaluation of important properties of biomolecules, but the quality of some structures, as well as their presentation, is not always fully acceptable. Whereas quality checking tools provided by the PDB during the deposition process may flag some common problems, the resulting red flags are not always addressed by deposition authors. Some journals require that manuscripts be accompanied by validation reports in order to assist reviewers in evaluation of the validity of presented structures, whereas other journals do not have such requirements. Additionally, validation reports are more helpful in identifying global problems, while some local problems may not be apparent. Utilization of additional tools and interactive software might assist readers in making the best use of published structural data. Using examples extracted from the Protein Data Bank, as well as from journal publications, some common problems will be identified and suggestions will be made on how to avoid their re-occurrence.
- 11 November 2024 14.00-15.00 UK time.
- Soledad Villar (Mathematical Institute for Data Science, Johns Hopkins University, US)
- Title. Learning functions on symmetric matrices and point clouds via lightweight invariant features.
- Abstract. In this work, we present a mathematical formulation for machine learning of (1) functions on symmetric matrices that are invariant with respect to the action of permutations by conjugation, and (2) functions on point clouds that are invariant with respect to rotations, reflections, and permutations of the points. To achieve this, we construct O(n^2) invariant features derived from generators for the field of rational functions on n×n symmetric matrices that are invariant under joint permutations of rows and columns. We show that these invariant features can separate all distinct orbits of symmetric matrices except for a measure zero set; such features can be used to universally approximate invariant functions on almost all weighted graphs. For point clouds in a fixed dimension, we prove that the number of invariant features can be reduced, generically without losing expressivity, to O(n), where n is the number of points. We combine these invariant features with DeepSets to learn functions on symmetric matrices and point clouds with varying sizes. We empirically demonstrate the feasibility of our approach on molecule property regression and point cloud distance prediction.
- 28 October 2024 14-15 UK time.
- Sidney Lisanza (Prescient Design and the lab of Nobel laureate David Baker, US).
- Title. Multistate and functional protein design using RoseTTAFold sequence space diffusion (based on this paper)
- Abstract. Protein denoising diffusion probabilistic models are used for the de novo generation of protein backbones but are limited in their ability to guide generation of proteins with sequence-specific attributes and functional properties. To overcome this limitation, we developed ProteinGenerator (PG), a sequence space diffusion model based on RoseTTAFold that simultaneously generates protein sequences and structures. Beginning from a noised sequence representation, PG generates sequence and structure pairs by iterative denoising, guided by desired sequence and structural protein attributes. We designed thermostable proteins with varying amino acid compositions and internal sequence repeats and cage bioactive peptides, such as melittin. By averaging sequence logits between diffusion trajectories with distinct structural constraints, we designed multistate parent–child protein triples in which the same sequence folds to different supersecondary structures when intact in the parent versus split into two child domains. PG design trajectories can be guided by experimental sequence–activity data, providing a general approach for integrated computational and experimental optimization of protein function.
- 7 October 2024 14.00-15.00 UK time.
- Miloslav Torda (Materials Innovation Factory, Liverpool, UK)
- Title. Geometric Modeling in the Prediction of Molecular Crystal Structures: The Close-Packing Principle Revisited.
- Abstract. A. I. Kitaigorodsky’s Close-Packing Principle states that “the mutual arrangement of the molecules in a crystal is always such that the ‘projections’ of one molecule fit into the ‘hollows’ of adjacent molecules,” highlighting the underlying geometries of molecular interactions in crystalline ground states. We revisit this principle to demonstrate that a geometric approach can substantially reduce the computational effort required to predict stable phases of molecular crystals. By leveraging the duality between interaction energy and packing density through geometry-informed programming, we generate first-approximation structures during the initial stages of Crystal Structure Prediction (CSP) within minutes on a standard laptop. Focusing on close packings of volumetric molecular models—where interactions are predominantly van der Waals forces without significant charges or electron exchange—we apply this purely geometric method to various rigid organic molecules. Our results show that integrating the Close-Packing Principle as an inductive bias in CSP workflows significantly reduces computational complexity, leading to more efficient and accurate predictions of molecular crystals.
- 30 September 2024 14-15 UK time. Slides (pdf, 971K)
- Erica L. Flapan (Emerita Professor at Pomona College, US)
- Title. A tile model of entangled proteins (based on this paper).
- Abstract. It has been established that knots, slipknots, and other complex topologies exist in cellular proteins and have been conserved throughout evolution. Yet the global topology of a chain only provides a limited view of its topological entanglement. Even if the chain is unknotted, there can be tangling of sites which may be distant on the amino acid sequence but are twisted together and held in place by molecular forces. We model this type of entanglement by starting with a collection of 1-string and 2- string tiles that are based on local entanglements that have been observed in protein chains. We then combine our tiles according to a well defined set of operations. Using the algebra of tiles and operations we symbolically represent all entanglements that can be built up in this way, and determine which knot types can occur.
- 18 June 2024 13.00-14.00 UK time. Video (52 min)
- Anthony Nixon (School of Mathematics, Lancaster University, UK)
- Title. Quotient graphs of symmetrically rigid frameworks.
- Abstract. A bar-joint framework (G,p) is the combination of a finite simple graph G = (V,E) and a map p assigning positions in Euclidean d-space to the vertices and hence lengths to the edges. The framework is flexible if there exists an edge-length-preserving continuous deformation of the vertices that does not arise from an isometry of the space. If no such deformation exists then the framework is rigid. In recent years many groups have extended rigidity theoretic results to symmetric frameworks motivated, e.g., by applications to crystals. In particular we concentrate on the following question. Given a graph G with a non-trivial automorphism group and a framework (G, p) which is realised with the corresponding symmetry group, does there exist a deformation that preserves the symmetry of the framework? Detailed combinatorial characterisations of symmetric rigidity are known for a range of symmetry groups in the plane. After giving a gentle introduction to the general subject of rigidity I will report on joint work with Dewar, Grasegger and Kastis where we provide a variety of combinatorial and probabilistic results for symmetric frameworks in d-dimensions.
- 11 June 2024 15.00-16.00 UK time. Video (51 min)
- Stephen Power (School of Mathematics, Lancaster University, UK)
- Title. The mathematics of crystal flexes and vibrations.
- Abstract. A crystal framework is a periodic bar-joint framework in Euclidean space. A number of characterisations of their rigidity, flexibility and zero modes (rigid unit modes of oscillation) have been obtained by mathematicians in the last fifteen years or so. Also, in materials science, over a much longer period, there have been extensive studies of stability, zero modes and phase transitions. Most of these accounts assume some form of periodic boundary conditions in modelling the dynamics. I will outline some of the mathematical results, especially the connection between the zero modes, first-order flexes, the rigidity matrix and the transfer function (or symbol function). It is the transfer function that determines the RUM spectrum (of interest in materials science) as well as a more general geometric spectrum that is of significance for surface modes. I shall also discuss necessary and sufficient conditions obtained for "absolute" first order rigidity, for which no boundary conditions are assumed (joint work with E. Kastis).
- 4 June 2024 14-15 UK time. Video (41 min)
- Gregory McColm (University of South Florida, US)
- Title. Voltage Graphs and their Realizations.
- Abstract. A periodic graph may be conveniently packaged as a labeled finite “voltage graph”, from which the original periodic graph may be derived. But this representation was not possible when dealing with a crystal structure and a group that did not act freely on that structure. A recent generalization of voltage graphs now enables a voltage graph representation of any crystal structure and any crystallographic group acting on that structure. We outline the older voltage graph construction and the recent generalization and explore the problem of realizing them in Euclidean space.
- 14 May 2024 14.00-15.00 UK time.
- Jonathan Balasingham (Materials Innovation Factory, University of Liverpool, UK)
- Title. Applications of Isometry Invariants on Material Property Prediction.
- Abstract. Periodic material or crystal property prediction using machine learning has grown popular in recent years as it provides a computationally efficient replacement for classical simulation methods. A crucial first step for any of these algorithms is the representation used for a periodic crystal. While similar objects like molecules and proteins have a finite number of atoms and their representation can be built based upon a finite point cloud interpretation, periodic crystals are unbounded in size, making their representation more challenging. Isometry invariants offer a consistent way to represent and compare crystal structures. In the present work, we adapt two isometry invariants, the Pointwise Distance Distribution (PDD) and Average Minimum Distance (AMD) as a representation for our learning algorithm. The PDD and AMD distinguished all (more than 660 thousand) periodic crystals in the Cambridge Structural Database as purely periodic sets of points without atomic types. For machine learning models, the composition of a crystal contains crucial information needed for making accurate predictions. We introduce a method for combining the geometric information contained in the isometry invariants with compositional information and show its effectiveness on the commonly used Jarvis-DFT and Materials Project datasets.
- 30 April 2024 14.00-15.00 UK time. Video (52 min)
- Philip Smith (Henry Royce Institute, University of Manchester, UK)
- Title. A data-driven analysis of HDPE post-consumer recyclate for sustainable bottle packaging [based on the RCR paper].
- Abstract. The packaging industry faces mounting demand to integrate post-consumer recyclate (PCR). However, the complex structure-property relationships of PCRs often obscure their performance compared to virgin equivalents, posing challenges in selecting suitable PCRs for applications. Focused on extrusion blow moulding grade high-density polyethylene (HDPE), this study presents the most extensive characterisation of HDPE PCR to date, encompassing 23 resins (3 virgin, 20 PCR). Employing Fourier-transform infrared spectroscopy (FTIR), differential scanning calorimetry (DSC), thermogravimetric analysis (TGA), rheology, colour analysis, and mechanical testing, we established a feature-rich dataset with 56 distinctive characteristics. Utilising a data science approach based on principal component analysis, with the virgin samples as a benchmark, we identified that combining FTIR, TGA and mechanical testing provided effective identification of PCRs that closely match the properties of virgin HDPE. The pipeline created can be utilised for new PCRs to determine suitability as a replacement for virgin plastic in a desired application.
- 25 March 2024 14.00-15.00 UK time. Video (26 min)
- Daniel Widdowson (Materials Innovation Factory, University of Liverpool, UK)
- Title. Ultra-fast detection of (near-)duplicate structures across major crystal databases.
- Abstract. The Cambridge Structural Database (CSD) and the
Crystallography Open Database (COD) contain thousands of structures.
Large portions of these databases overlap, often because their entries
originate from the same publication. Constructing a list of CSD-COD
cross entries is difficult because data can be reformatted in the
curation process, many data points do not reliably distinguish or
identify crystals, and most ways of finding matches are slow,
requiring the order of 10^10 comparisons. Using geometry-based
structural invariants (PDD [1]: Pointwise Distance Distribution and
its simplified version AMD [2]: Average Minimum Distance), we compared
all-vs-all entries in the CSD and COD, discovering the extent of their
overlap for the first time. Using these invariants, we compared
1,214,848 entries from the CSD against 508,392 from COD, taking only
17 minutes on a typical desktop computer. Over 400,000 crystals were
matched, an overlap of 33% of the CSD and 80% of COD. We also found a
significant overlap of COD with the ICSD (over 50,000 entries), as
well as at least minor overlaps with the Materials Project database.
We additionally searched for duplicate entries, finding several
thousand in both the CSD and COD, many of which are not listed
anywhere as known duplicates.
[1] Neural Information Processing Systems 35, 24625-24638 (2022).
[2] MATCH Communications Math. Computer Chemistry 87, 529-559 (2022).
- 19 March 2024 14.00-15.00 UK time.
- Kirill Cherednichenko (University of Bath, UK)
- Title. Operator-norm homogenisation for Maxwell equations on periodic singular structures.
- Abstract. I will discuss a new approach to obtaining uniform operator asymptotic estimates in periodic homogenisation. Based on a novel uniform Poincaré-type inequality, it bears similarities to the techniques I developed with Cooper (ARMA, 2016) and Velčić (JLMS, 2022). In the context of the Maxwell system, the analytic framework I will present leads to a new representation for the asymptotics obtained by Birman and Suslina in 2007 for the full system and by Suslina in 2004 for the electric field in the presence of currents. As part of the new asymptotic construction, I will link the leading-order approximation to a family of ``homogenised" problems, which was not possible using the earlier method. The analysis presented applies to a class of inhomogeneous structures modelled by arbitrary periodic Borel measures. However, the results are new even for the particular case of the Lebesgue measure. This is joint work with Serena D'Onofrio.
- 26 February 2024 14.00-15.00 UK time.
- Laurent Younes (Johns Hopkins University, US)
- Title. Incorporating growth models in Riemannian shape spaces.
- Abstract. We review and introduce some Riemannian metrics and evolution equations in the context of diffeomorphic shape analysis. After a short discussion of various approaches at building Riemannian metrics on shape spaces (with a special focus on the foundations of the large deformation diffeomorphic metric mapping algorithm), we introduce elastic metrics and growth models that can be derived from them. In the latter context, a new class of metrics, involving an infinitesimal growth tensor, is considered and some of its properties are studied.
- 20 February 2024 14.00-15.00 UK time.
- Borjan Geshkovski (INRIA, Sorbonne Université, France)
- Title. A mathematical perspective on Transformers.
- Abstract. Transformers play a central role in the inner workings of large language models. We develop a mathematical framework for analyzing Transformers based on their interpretation as interacting particle systems, which reveals that clusters emerge in long time. Our study explores the underlying theory and offers new perspectives for mathematicians as well as computer scientists.
- 13 February 2024 15.00-16.00 UK time (9 am Minnesota time). Video (57 min)
- Peter J. Olver (School of Mathematics, University of Minnesota, US)
- Title. Reassembly of broken objects.
- Abstract. The problem of reassembling and analyzing broken objects appears in a broad range of applications, including jigsaw puzzle assembly, paleontology (broken fossils and egg shells), archaeology (broken pots and statues), surgery (broken bones and reassembly of histological sections), and anthropology (broken bones and lithics). I will discuss recent progress on such problems based on new and classical mathematical tools, including differential geometry, invariant signatures, integral invariants, and machine learning.
- 6 February 2024 15.00-16.00 UK time (9 am Chicago time). Video (57 min)
- YoonHaeng Hur (Department of Statistics, University of Chicago, US)
- Title. Matching via Distance Profiles.
- Abstract. In this paper, we introduce and study matching methods based on distance profiles. For the matching of point clouds, the proposed method is easily implementable by solving a linear program, circumventing the computational obstacles of quadratic matching. Also, we propose and analyze a flexible way to execute location-to-location matching using distance profiles. Moreover, we provide a statistical estimation error analysis in the context of location-to-location matching using empirical process theory. Furthermore, we apply our method to a certain model and show its noise stability by characterizing conditions on the noise level for the matching to be successful. Lastly, we demonstrate the performance of the proposed method and compare it with some existing methods using synthetic and real data.
- 23 January 2024 14.00-15.00 UK time. Video (67 min) Slides
- Daniel Huson (Department of Computer Science, University of Tübingen, Germany)
- Title. A galaxy of periodic tilings.
- Abstract. This talk has four parts. In the first part - topology -, we recall the classification of two-dimensional manifolds and orbifolds, and introduce their Conway-Macbeath notation. In the second part - geometry -, we discuss two-dimensional periodic tilings and their classification. In the third part - combinatorics -, we introduce Delaney-Dress symbols and indicate how they can be used to systematically enumerate periodic tilings. Finally, in the fourth part - algorithms and software, we present Tegula (http://tegula.husonlab.org), a computer program that can be used the explore the “galaxy” of all 2.4 billion periodic tilings of the euclidean plane, sphere and hyperbolic plane, of low Dress-complexity. This is joint work with Olaf Delgado and Rüdiger Zeller (https://doi.org/10.1016/j.cagd.2021.102027).
Back to All seminars | Back to Top of this page | Back to Home page
- 12 December 2023 14.00-15.00 UK time. Video (67 min)
- Alexey Garber (Mathematical and Statistical Sciences, University of Texas Rio Grande Valley, US)
- Title. On the metric spaces of lattices and periodic point sets.
- Abstract. In the talk I will describe some properties of the family of discrete point sets in R^d as the metric space equipped with the bottleneck distance. Mainly, I will provide different conditions, under which the spaces of lattices or periodic point sets are Lipschitz or coarsely (non)embeddable into Hilbert space. The various conditions are phrased in terms of density, packing radius, covering radius, the cardinality of the motif, and the diameter of the unit cell. While the main motivation will be to provide some structure for the smaller family of lattices and periodic point sets, I will also review some known results for quasiperiodic point sets and connections to geometry of numbers, and aperiodic order. The talk is based on a joint work with Žiga Virk and Nicolò Zava.
- 5 December 2023 16.00-17.00 UK time (9-10 am Utah time).
- Yekaterina Epshteyn (Department of Mathematics, University of Utah, US)
- Title. Grain Boundaries in Polycrystals: Modeling, Analysis and Simulation.
- Abstract. Many technologically useful materials are polycrystals composed of small monocrystalline grains that are separated by grain boundaries of crystallites with different lattice orientations. One of the central problems in materials science is to design technologies capable of producing an arrangement of grains that delivers a desired set of material properties. A method by which the grain structure can be engineered in polycrystalline materials is through grain growth (coarsening) of a starting structure. Grain growth in polycrystals is a very complex multiscale multiphysics process. It can be regarded as the anisotropic evolution of a large cellular network and can be described by a set of deterministic local evolution laws for the growth of an individual grain combined with stochastic models for the interaction between them. In this talk, we will present new perspectives on modeling, simulation, and analysis of the evolution of the grain boundary network in polycrystalline materials. Relevant recent experiments will be discussed as well.
- 28 November 2023 17.00-18.00 UK time (9-10 am Berkeley time). Video (51 min)
- David Prendergast (Molecular Foundry, Lawrence Berkeley lab, US)
- Title. Unsupervised learning of representative local atomic arrangements in molecular dynamics data.
- Abstract. Molecular dynamics (MD) simulations present a data-mining challenge, given that they can generate a considerable amount of data but often rely on limited or biased human interpretation to examine their information content. By not asking the right questions of MD data we may miss critical information hidden within it. We combine dimensionality reduction (UMAP) and unsupervised hierarchical clustering (HDBSCAN) to quantitatively characterize the coordination environment of chemical species within MD data. By focusing on local coordination, we significantly reduce the amount of data to be analyzed by extracting all distinct molecular formulas within a given coordination sphere. We then efficiently combine UMAP and HDBSCAN with alignment or shape-matching algorithms to classify these formulas into distinct structural isomer families. The outcome is a quantitative mapping of the multiple coordination environments present in the MD data. The method was employed to reveal details of cation coordination in electrolytes based on molecular liquids.
- 21 November 2023 14.00-15.00 UK time. Video (53 min)
- Daniel Rigden (Institute of Systems, Molecular and Integrative Biology, University of Liverpool, UK).
- Title. Tertiary structure assessment at CASP15.
- Abstract. The results of tertiary structure assessment at CASP15 are reported. For the first time, recognizing the outstanding performance of AlphaFold 2 (AF2) at CASP14, all single-chain predictions were assessed together, irrespective of whether a template was available. At CASP15, there was no single stand-out group, with most of the best-scoring groups—led by PEZYFoldings, UM-TBM, and Yang Server—employing AF2 in one way or another. Many top groups paid special attention to generating deep Multiple Sequence Alignments (MSAs) and testing variant MSAs, thereby allowing them to successfully address some of the hardest targets. Such difficult targets, as well as lacking templates, were typically proteins with few homologues. Local divergence between prediction and target correlated with localization at crystal lattice or chain interfaces, and with regions exhibiting high B-factor factors in crystal structure targets, and should not necessarily be considered as representing error in the prediction. However, analysis of exposed and buried side chain accuracy showed room for improvement even in the latter. Nevertheless, a majority of groups produced high-quality predictions for most targets, which are valuable for experimental structure determination, functional analysis, and many other tasks across biology. These include those applying methods similar to those used to generate major resources such as the AlphaFold Protein Structure Database and the ESM Metagenomic atlas: the confidence estimates of the former were also notably accurate.
- 14 November 2023 14.00-15.00 UK time. Video (28 min)
- Flaviu Cipcigan (IBM Research, Hartree Centre, Daresbury, UK).
- Title. Discovery of Novel Reticular Materials for Carbon Dioxide Capture using GFlowNets.
- Abstract. Artificial intelligence holds promise to improve materials discovery. GFlowNets are an emerging deep learning algorithm with many applications in AI-assisted discovery. By using GFlowNets, we generate porous reticular materials, such as metal organic frameworks and covalent organic frameworks, for applications in carbon dioxide capture. We introduce a new Python package (matgfn) to train and sample GFlowNets. We use matgfn to generate the matgfn-rm dataset of novel and diverse reticular materials with gravimetric surface area above 5000 m2/g. We calculate single- and two-component gas adsorption isotherms for the top-100 candidates in matgfn-rm. These candidates are novel compared to the state-of-art ARC-MOF dataset and rank in the 90th percentile in terms of working capacity compared to the CoRE2019 dataset. We discover 15 materials outperforming all materials in CoRE2019.
- 7 November 2023 14.00-15.00 UK time. Video (39 min)
- Alexandra Volokhova (Mila AI Quebec Institute, Canada).
- Title. Crystal-GFN: sampling crystals with desirable properties and constraints.
- Abstract. Accelerating material discovery holds the potential to greatly help mitigate the climate crisis. Discovering new solid-state crystals such as electrocatalysts, ionic conductors or photovoltaics can have a crucial impact, for instance, in improving the efficiency of renewable energy production and storage. In this paper, we introduce CrystalGFlowNet, a generative model of crystal structures that sequentially samples a crystal’s composition, space group and lattice parameters. This domain-inspired approach enables the flexible incorporation of physical and geometrical constraints, as well as the use of any available predictive model of a desired property as an objective function. We evaluate the capabilities of Crystal-GFlowNet by using as objective the formation energy of a crystal structure, as predicted by a new proxy model trained on MatBench. The results demonstrate that Crystal-GFlowNet is able to sample diverse crystals with low formation energy.
- 31 October 2023 14.00-15.00 UK time. Video (55 min)
- Ruslan Davidchack (Computing and Mathematical Sciences, University of Leicester, UK).
- Title. Excess chemical potential and surface free energy of a hard-sphere fluid at curved walls: deviations from Morphometric Thermodynamics.
- Abstract. Excess chemical potential (or solvation-free energy) is an important thermodynamic quantity that characterises the properties of fluids in contact with surfaces and the behaviour of fluid mixtures. General theoretical analysis of this quantity is rather complicated and available only for simple systems (like hard-sphere fluids) in various approximations (Perkus-Yevick integral equations, Scaled Particle Theory, density-functional theory). A significant simplification can be obtained under the assumptions of the so-called Morphometric Thermodynamics (MT), based on Hadwiger's theorem from integral geometry. We use a combination of Gibbs-Cahn integration and Widom's particle insertion methods to obtain high-precision measurements of the excess chemical potential and surface-free energy. With these results, we are able to observe deviations from the MT expressions and thus test the extent of applicability of the MT assumptions.
- 17 October 2023 14.00-15.00 UK time. Video (57 min)
- Andreas Alpers (Mathematical Sciences, University of Liverpool, UK).
- Title. Optimal Diagram Representations of Polycrystalline Microstructures.
- Abstract. Voronoi diagrams and their generalizations, particularly so-called generalized balanced power diagrams (GBPDs), are increasingly used to represent grain shapes of polycrystalline materials at the microstructural scale. These diagrams can be described by few parameters and, in principle, only these parameters need to be recovered from tomographic measurements. Experimentalists have reported, however, that some parameters seem more important than others for describing certain families of materials. In this talk, I will focus on this finding and will provide some theoretical explanations within the framework of statistical learning theory. I will also discuss how the task of finding an optimal set of parameters can be formulated and solved via a support vector machine approach. (The talk is based on recent joint work with P. Gritzmann, M. Fiedler, and F. Klemm.)
- 10 October 2023 14.00-15.00 UK time. Video (47 min)
- Leonid Polterovich (Mathematical Sciences, Tel Aviv University, Israel).
- Title. Oscillations and topology.
- Abstract. I'll discuss an approach to studying oscillations of functions based on ideas of topological data analysis. Applications include generalizations of two classical results, Courant's nodal domain theorem in spectral geometry and Bezout's theorem in algebraic geometry. Joint with Lev Buhovsky, Jordan Payette, Iosif Polterovich, Egor Shelukhin, and Vukašin Stojisavljević.
- 28 September 2023 14.00-15.00 UK time.
- Saulius Gražulis (Vilnus University Institute of Biotechnology, Lithuania).
- Title. Open Crystallographic Databases: COD, TCOD, and the sisters.
- Abstract. Crystallographic data are of great value for modern science. Indeed, any observed properties of the matter must be compatible with atomic positions determined by crystallography. With the advent of the Internet, unobstructed access to data has become a must, and open databases are being created in all branches of science. Crystallography Open Database (COD, https://crystallography.net) is currently the world's largest open-access database of experimental chemical crystallography results. Its sister databases Theoretical COD (TCOD) and Predicted COD (PCOD) publish results of corresponding theoretical computations in a compatible form. In my talk, I would like to present the scope, organisation principles, and possible application areas of these databases.
- 14 September 2023 14.00-15.00 UK time. Video (55 min) Slides
- Alex Hernandez-Garcia (Mila AI Quebec Institute, Canada).
- Title. Multi-fidelity active learning with GFlowNets for drug and materials discovery.
- Abstract. In recent years, machine learning has been increasingly adopted by researchers from different disciplines as a tool to accelerate the pace to scientific discoveries. However, many relevant scientific problems present challenges where current machine learning methods cannot yet efficiently leverage the available data and resources. For example, such tasks involve exploring very large, high-dimensional spaces, where querying a high fidelity, black-box objective function is very expensive. Progress in machine learning methods that can efficiently tackle such problems would help accelerate currently crucial areas such as drug and materials discovery. In this talk, I will present our work on the use of generative flow networks (GFlowNets) for multi-fidelity active learning, where multiple approximations of the black-box function are available at lower fidelity and cost. GFlowNets are a recently proposed framework for amortised probabilistic inference that have proven efficient for exploring large, high-dimensional spaces and can hence be practical in the multi-fidelity setting too. I will provide a gentle introduction to GFlowNets, describe our algorithm for multi-fidelity active learning with GFlowNets and present the results on both well-studied synthetic tasks and practically relevant applications of molecular discovery. Finally, I will discuss future directions in materials discovery applications where we plan to use multi-fidelity active learning with GFlowNets.
- 18 August 2023 14.00-15.00 UK time. Video (29 min)
- Daniel Widdowson (Materials Innovation Factory, University of Liverpool, UK).
- Title. Resolving the data ambiguity for periodic crystals and detecting (near-)duplicate structures (practice for IUCr 2023).
- Abstract. The most common definition of a crystal using a motif and unit cell is ambiguous since any crystal can be described by many different choices of motif and cell. We introduce our work on crystal descriptors which are ‘isometry invariants’ meaning any choice of motif and cell gives the same result. These invariants have mathematical properties that allow us to talk about individual crystals as being points in a ‘crystal space’ where the invariant acts as a coordinate, and geometrically similar crystals can be measured as such by simply comparing their coordinates. Fast comparisons between these invariants have enabled us to compare all crystals in the Cambridge Structural Database (CSD), Crystallography Open Database and Materials Project database to find geometrically identical or similar crystals. At least five examples in the CSD turned out to be worthy of investigation as data was potentially duplicated, but modified to avoid detection.
- 17 August 2023 14.00-15.00 UK time.
- Matthew Bright (Materials Innovation Factory, University of Liverpool, UK).
- Title. Continuous Maps of 2D Lattices (practice for the contributed 15-min talk at A023 IUCr 2023). Video (18 min)
- Abstract. Given the massive growth in the number of periodic structures deposited in research databases, and the ability of Crystal Structure Prediction to generate hundreds of thousands of structures, many of which will be near-identical, it is becoming important to have a classification approach for structures that is finer-grained than the finite number of discrete symmetry groups into which they fall, and which allows for fast numerical comparison. We have approached the problem from the ‘bottom up’, by making a rigorous analysis of the very simplest non-trivial periodic structure - the two dimensional lattice - in a way that we hope to extend to more complex structures. The result is the Root Invariant. from which we derive the scale-agnostic Projected Invariant, both with orientation aware versions that distinguish lattices related by a reflection) - an easily computable, complete isometry invariant which can be modified to differentiate lattices related by a reflection and from which a lattice can be uniquely reconstructed up to isometry (or rigid motion, or similarity). In this talk we will give a brief overview of the theory behind the invariant and use it to display a map of 2D lattices derived from a large database of crystal structures, showing that modulo physical constraints real world geometric structures continuously occupy the space of all possible lattices.
- Title. Continuous Chiral Distances on 2D Lattices (practice for the invited 25-min talk at A065 IUCr 2023). Video (23 min)
- Abstract. Useful properties of many periodic materials can arise from asymmetry - that is, how far deformed those structures are from one with higher symmetry. We are therefore interested in developing some numerical quantifier of this deformation for periodic structures, beginning with two dimensional lattices and rigorously developing this simple but non-trivial case in a way that is applicable to higher dimensional structures. In previous publications (and in a talk at A023) we have developed complete invariants of two dimensional lattices which have the crucial property of changing continuously - that is, a small perturbation of the lattice leads to a small perturbation of the invariant. We use this invariant to develop a true metric between any pair of lattices and show how this can be used to define an easily computable family of chiral distances that measure how close any given lattice is to having a particular Bravais system. We apply this measure initially to a very large database of 2D lattices generated from 3D crystal structures, and more practically to public databases of actual and theoretical 2D monolayer structures to observe their asymmetry when part of and separate from their parent material, and also to isolate candidate materials from which we might synthesise monolayers with highly asymmetric lattice geometries.
- 14 June 2023 10.00-11.00 UK time.
- Bartosz Naskręcki (Adam Mickiewicz University, Poland)
- Title. Crystallographic growth functions.
- Abstract. In the sequel of the previous talk, I will discuss with examples the notion of crystallographic growth functions. Building on the previously defined topological ones we will discuss the benefit of using these more elaborate functions. In particular, I will show the 2D version, which we pictorially represent with the so-called orphic diagrams. I will also make a demo of our Python script that generates orphic diagrams.
- 9 June 2023 14.00-15.00 UK time.
- Bartosz Naskręcki (Adam Mickiewicz University, Poland)
- Title. Growth functions in crystallography.
- Abstract. The topic of this talk is concerned with the discussion of counting the number of cells in filtrations of infinite CW-complexes. This has applications in crystallography and is related to the earlier notion of coordination sequences. I will present some results related to mathematical underpinnings of these notions and present some new results in the context of counting cells against the growing reference frame. In the final part I will sketch a new idea of using Ehrhardt polynomials to explain some symmetry phenomena related to the growth functions.
- 21 April 2023 14.00-15.00 UK time. Video (44 min)
- Ezra Peisach (Rutgers University, US)
- Title. The wwPDB Validation Process.
- Abstract. The Protein Data Bank (PDB) was founded in 1971 and is the world wide repository of over 200,000 macro-molecular structures of proteins, nucleic acids and their complexes with small ligands and peptides. The PDB is managed by the World Wide Protein Data Bank (wwPDB). In order to aid depositors, reviewers and consumers of biological structural data in the archive, the wwPDB has established a validation pipeline for all structures. This talk will cover a brief presentation on X-ray structure refinement followed by the wwPDB validation pipeline to show how the validation process is of use to depositors and end users alike.
- 31 March 2023 14.00-15.00 UK time. Video (58 min)
- Matthew Bright (Materials Innovation Factory, University of Liverpool, UK)
- Title. Three Applications of 2D Lattice Invariants.
- Abstract.
The first application will be a deeper investigation of our first map - that of 2.7 million lattices derived from 3D structures in the CSD. We will
discuss the key structural features of the complete map, before demonstrating how invariants can be used to investigate the impact of
chemical properties such as molecular weight on lattice symmetry.
The second will be an application to the geometry of 2D Materials. Since the isolation of graphene, the number of both actual and theoretical 2D materials in the literature has been rapidly growing. Materials with a strongly asymmetric lattice are of interest to chemists, since they are likely to have anisotropic properties - conductivity, for example, may be higher in one direction and lower in another.
We will discuss the available data on these materials, and demonstrate the application of chirality distances in two cases: 2DMatPedia - one of the largest publicly available databases of materials [1] - and the smaller 2D Materials Database [2]. The use of chirality distances allows us to seek out potentially highly asymmetric structures, and to investigate changes in the symmetry of potential 2D materials as layers are extracted from the bulk crystal.
Finally, we consider the more theoretical topic of ‘random lattices’. Randomly selecting from the space of all possible lattices (at a given fixed scale) is equivalent to defining a ‘uniform’ probability measure on the space of all lattices modulo isometry and basis change.
The work of Prof Jens Marklof et al. [3] on the explicit definition of this measure can be used to implement random lattice generation. An interesting question is whether the distribution of naturally occurring 2D lattices in chemistry is similar to this ‘Haar Random Lattice’ distribution - that is, does nature explore all of ‘lattice space’? We might expect this not to be the case, but our early investigations have shown surprising similarities between the two distributions.
[1] J.Zhou et al, ‘2DMatPedia, an open computational database of two-dimensional materials from top-down and bottom-up approaches’ Scientific Data 86, 669 (2019).
[2] N.Mounet et al, ‘Two-dimensional materials from high-throughput computational exfoliation of experimentally known compounds’. Nature Nanotechnology 13, 246 (2018).
[3] J.Marklof. ‘Random lattices in the wild: from Polya’s orchard to quantum oscillators’. LMS newsletter 493 43-50 (2021).
- 24 March 2023 14.00-15.00 UK time. Video (60 min)
- Matthew Bright (Materials Innovation Factory, University of Liverpool, UK)
- Title. Theory of 2D Lattice Invariants.
- Abstract.
The central problem for our group is the representation of periodic structures as points in some continuous space, in the sense
that a small perturbation of the structure itself gives rise to a small change in the position of the point which represents.
Our solution [1] to this problem for a simple periodic structure - the 2D Lattice - allows for many different kinds of representation, all topologically equivalent to a punctured sphere. The solution gives rise to a very natural quantification of the ‘asymmetry’ of a lattice - that is, the distance through which it would need to be distorted such that it became a lattice with a particular symmetry group. We will briefly recap the development of 2D Lattice Invariants and the associated chiral distances [2].
[1] V.Kurlin. Mathematics of 2D lattices. Foundations of Computational Mathematics.
[2] M.Bright et al. Geographic-style maps for 2D lattices. Acta Cryst A v.79 (2023), p.1-13.
- 15 March 2023 13.00-14.00 UK time (in person at the Chemistry department).
- Nicholas Kotov (University of Michigan, US)
- Title. Chirality-Complexity Relations and Graph Theory of Nanostructures.
- Abstract. Since Leonardo Da Vinci discoveries in science and engineering were inspired by evolution-optimized geometry of molecules, tissues, and organisms found in biology using non-biological preparatory techniques. Chiral nanostructures – a large and rapidly evolving class of metal, semiconductor, and ceramic materials is one of these materials. Besides fascinating optical, catalytic, and biological properties, the studies of chiral nanostructures revealed something more. Unlike other geometric properties, mirror asymmetry is invariant to scales. Thus, the synthesis and self-assembly of chiral nanostructures showed how basic geometric properties of Nature’s smallest building blocks can produce highly complex and adaptable structures at macroscale.
Analysis of the hierarchically organized micro- and macrostructures obtained by self-assembly of the chiral nanoparticles (NPs) demonstrated the mechanism of emergence of effective complexity in such systems and how such diversity of the building blocks contributes to it. These findings became possible by applying graph theory (GT) for calculation of the quantitative measures of their complexity by describing the constituent NPs as nodes and the interfaces between them as edges of graphs. Taking an example of hierarchically organized particles with twisted spikes from polydisperse Au-cystein nanoplatelets [1], we found that (a) formation of complex structures does not require monodispersity; (b) complexity index (CI) of the synthetic particles can be higher than biological prototypes; and (c) complexity emerges from competing chirality-dependent assembly restrictions. The GT description of chiral hedgehogs can also be expanded to other nanoscale structures creating analogs of chemical formulas for particle systems [2]. Among other outcomes of the analysis of the chirality-complexity relations, GT-based description of nanostructures leads to quantitative description of biomimetic materials combining order and disorder that is essential to their functionality. Expansion of GT principles from particles to composites enabled transition from inexact approach of their good-luck-based engineering to function-driven design encompassing multiple properties. While this work is still in progress, the methods of GT-based biomimetic materials engineering can be demonstrated by the multiparameter optimization of complex networks of aramid nanofibers for batteries for robotics [3] and biomedical implants [4]. Full abstract with images in pdf.
References
[1] W. Jiang, et al, Emergence of Complexity in Hierarchically Organized Chiral Particles, Science, 2020, 368, 6491, 642.
[2] S. Zhou, et al, Chiral assemblies of pinwheel superlattices on substrates, Nature, 2022, 612, 259.
[3] Wang, M.; Vecchio, D.; et al Biomorphic Structural Batteries for Robotics. Sci. Robot. 2020, 5 (45), eaba1912.
[4] H. Zhang, et al Graph Theoretical Design of Biomimetic Aramid Nanofiber Nanocomposites as Insulation Coatings for Implantable Bioelectronics, MRS Bulletin, 2021, 46, 7, 576.
- 10 March 2023 14.00-15.00 UK time.
- Luigi Vitagliano (Institute of Biostructures and Bioimaging, Italian National Research Council)
- Title. The fine structure of proteins: detection, characterization and implications. Part 2.
- Abstract. Continuation of the talk on 24th February.
- 24 February 2023 14.00-15.00 UK time.
- Luigi Vitagliano (Institute of Biostructures and Bioimaging, Italian National Research Council)
- Title. The fine structure of proteins: detection, characterization and implications. Part 1.
- Abstract. Proteins combine an extraordinary structural complexity with fine structural regulations that operates in response to environmental stimuli. Indeed, fundamental protein activities often rely on extremely subtle structural features that are frequently dictated by physical forces of limited strengths. Therefore, the elucidation of the basis of protein functionalities frequently requires accurate determinations of their intricate structures. Exploiting the impressive amount of data reported in structural databases (PDB and CSD), we have highlighted the interplay occurring between the geometry of the peptide bond, expressed in terms bond lengths, valence bond and plane distortions, and the local conformation of the polypeptide chains. The elucidation of this correlation has interesting implications (a) for protein structure validation and quality assessment, (b) for the definition of the structural bases of amino-acid conformational preferences, (c) for the identification of secondary forces impacting protein structures, and for the evaluation of force fields commonly used in molecular modelling and dynamics.
- 20 February 2023 (unusual day and time) 11.00-13.00 UK time.
- Nadav Dym (Technion's Faculty of Mathematics, Israel)
- Title. Efficient Invariant Embeddings for 3D point sets.
- Abstract.
In many machine learning tasks, the goal is to learn an unknown function which has some known group symmetries. Equivariant machine learning algorithms exploit this by devising architectures (=function spaces) which have these symmetries by construction. Especially relevant examples for chemistry based applications are neural networks for graphs or sets which respect their permutation symmetries, or neural networks for 3D point sets which additionally respect Euclidean symmetries.
A common theoretical requirement of symmetry based architecture is that they will be able to separate any two objects which are not related by a group symmetry (this property can be used to prove stronger universality results which we will shortly describe). We will review results showing that under very general assumptions such a symmetry preserving separating mapping f exists, and the embedding dimension m can be taken to be roughly twice the dimension of the data. We will then propose a general methodology for efficient computation of such f using random invariants. This methodology is a generalization of the algebraic geometry argument used for the well known proof of phase retrieval injectivity. We will show several applications of this result, and in particular explain how this can be combined with the results in Kurlin's paper (arxiv:2207.08502) to achieve (relatively) efficient separating mappings for 3D point clouds.
Based on work with Steven J. Gortler, Snir Hordan and Tal Amir and on the papers "Low Dimensional Invariant Embeddings for Universal Geometric Learning" (arxiv:2205.02956) by Nadav Dym and Steven J. Gortler, and "Complete Neural Networks for Euclidean Graphs" (arxiv:2301.13821) by Snir Hordan, Tal Amir, Steven J. Gortler and Nadav Dym.
- 17 February 2023 14.00-15.00 UK time. Video (37 min)
- Jonathan Balasingham (Liverpool Computer Science department, UK)
- Title. Graph representation of crystals using Pointwise Distance Distributions.
- Abstract. Use of graphs to represent crystal structures has become popular in recent years as they provide a natural translation from atoms and bonds to nodes and edges. Graphs capture structure, while remaining invariant to the symmetries that crystals display. Several works in property prediction, including those with state-of-the-art results, make use of the Crystal Graph. The present work offers a graph based on Pointwise Distance Distributions which retains symmetrical invariance, decreases computational load, and yields similar or better prediction accuracy on both experimental and simulated crystals.
- 10 February 2023 14.00-15.00 UK time. Video (39 min)
- Danny Ritchie (Materials Innovation Factory, University of Liverpool, UK)
- Title. Exploiting Powder X-ray Diffraction techniques for automated phase isolation in the laboratory.
- Abstract. A chemical composition consisting of n elements can be represented as Euclidean, where the components give the relative amount of each element. Consider an experimentally prepared composition s that has been heated such that it reaches thermodynamic equilibrium. At equilibrium it will decompose into a set of phases {p}. A ubiquitous method for the identification of new phases is to trial an arbitrary s in the hope that a p is formed which is new. The problem that this work aims to address is how to find the composition of the new p. The {p} are separated into the new phase, denoted u, and the known phases, denoted {k}. Powder X-Ray Diffraction (PXRD) is an experimental technique that can be used to estimate the relative ratios of {k}, and so the average composition of the {k} can be calculated; K. The importance of this is that s will be a weighted average of u and K, and so u will lie on the line segment extending from s away from K. Unfortunately the estimation of the relative ratios from PXRD is not exact, and so rather than being a point, K must be represented as a random variable. Assuming the distribution of K to be Gaussian, the induced probability density for u is estimated by a projection of K through s.
- 3 February 2023 14.00-15.00 UK time. Video (58 min) Slides (6M)
- Gabriel Stoltz (CERMICS, Ecole des Ponts ParisTech, France).
- Title. Coarse-graining and efficiently sampling with autoencoders.
- Abstract. A coarse-grained description of atomistic systems in molecular dynamics is provided by reaction coordinates. These nonlinear functions of the atomic positions are a basic ingredient to compute more efficiently average properties of the system of interest, such as free energy profiles. However, reaction coordinates are often based on an intuitive understanding of the system, and one would like to complement this intuition or even replace it with automated tools. One appealing tool is autoencoders, for which the bottleneck layer provides a low dimensional representation of high dimensional atomistic systems. In order to have an efficient numerical method, autoencoders should be combined with importance sampling techniques based on adaptive biasing methods. The algorithm then iterates between an update of the reaction coordinate, and free energy biasing. I will discuss some mathematical foundations of this method, and present illustrative applications for biophysical systems, including alanine dipeptide and chignolin. Some on-going extensions to more demanding systems, namely HSP90, will also be hinted at.
- 27 January 2023 14.00-15.00 UK time
- Jack Wang (Rice University, US).
- Title. RetMol: Retrieval-based Controllable Molecule Generation.
- Abstract. Generating new molecules with specified chemical and biological properties via generative models has emerged as a promising direction for drug discovery. However, existing methods require extensive training/fine-tuning with a large dataset, often unavailable in real-world generation tasks. In this work, we propose a new retrieval-based framework for controllable molecule generation. We use a small set of exemplar molecules, i.e., those that (partially) satisfy the design criteria, to steer the pre-trained generative model towards synthesizing molecules that satisfy the given design criteria. We design a retrieval mechanism that retrieves and fuses the exemplar molecules with the input molecule, which is trained by a new self-supervised objective that predicts the nearest neighbor of the input molecule. We also propose an iterative refinement process to dynamically update the generated molecules and retrieval database for better generalization. Our approach is agnostic to the choice of generative models and requires no task-specific fine-tuning. On various tasks ranging from simple design criteria to a challenging real-world scenario for designing lead compounds that bind to the SARS-CoV-2 main protease, we demonstrate our approach extrapolates well beyond the retrieval database, and achieves better performance and wider applicability than previous methods.
- 13 January 2023 14.00-15.00 UK time (joint with the journal club at AstraZeneca).
- Vitaliy Kurlin (Materials Innovation Factory, University of Liverpool, UK).
- Title. Mapping the spaces of crystalline materials and proteins.
- Abstract. Same or different? This question remained open for many real objects including periodic crystals and proteins. Since crystal structures are determined in a rigid form, there is little sense to distinguish them modulo rigid motion (a composition of translations and rotations). Considering the chirality or sign of orientation, it suffices to distinguish crystals modulo isometry, which is any transformation (for example, mirror reflection) maintaining inter-point distances. Geometric Data Science develops complete invariants that are DNA-style descriptors uniquely identifying classes of real objects modulo rigid motion, isometry, or other important equivalences. Such invariants practically distinguished all (660+ thousand) periodic crystals in the Cambridge Structural Database via 200+ billion pairwise comparisons. This experiment was completed over two days on a modest desktop, while traditional RMSD comparisons are estimated to require 34+ thousand years. The new invariants unexpectedly detected five pairs of duplicate structures that are geometrically identical, but one atomic identity was replaced with a different one (Cd with Mn in the pair HIFCAB vs JEPLIA). Since such a replacement seems physically impossible without perturbing geometry, five journals are investigating the integrity of the underlying publications. The more important conclusion is that all known and undiscovered crystals live in a common space of isometry classes of periodic point sets parameterized by complete invariants playing the role of geographic-style coordinates in this crystal universe whose first maps appeared in Acta Cryst A, v.79, p.1-13. A similar rigorous approach to proteins detected unexpected coincidences in the Protein Data Bank.
Back to All seminars | Back to Top of this page | Back to Home page
- 16 December 2022 14.00-15.00 UK time. Video (57 min)
- Semen Gorfman (Tel Aviv University, Israel).
- Title. The algorithms of target transformation of basis vectors of crystal lattice.
- Abstract (pdf with bio). The crystal lattice is the mathematical object that describes the long-range order (three-dimensional periodicity) of crystal structures. The lattice is set by non-coplanar vectors (also known as basis vectors), whose linear combinations with the integer coordinates describe the position of individual lattice points. While the lattice parameters (the lengths and the angles between the basis vectors) are often used as one of the fingerprints of a crystal structure, the choice of basis vectors is not ambiguous. Specifically, one and the same lattice can be created by an infinitely large number of sets of basis vectors. Transformation of basis vectors is the key mathematical operation that appears to be useful for the comparison of polymorphs crystal structures, phase transitions analysis, the understanding of twinning laws, indexing of X-ray diffraction peaks, and lattice reduction. While such operations are often built into larger computer programs and serve as only small blocks, the need for stand-alone algorithms of specific transformation of basis vectors of a crystal lattice still exist. The goal of this presentation is to introduce a simple numerical approach for the transformation of lattice basis vectors to a specific target: in the first case, one of the new basis vectors is aligned to a predefined lattice direction. In the second case (for a three-dimensional lattice) two of the basis vectors are brought to the lattice “plane” with specific Miller indices. Two, three and multidimensional versions of the algorithms will be presented. The applications for the simulation of zone planes and the transformation of a unit cell setting will be shown.
- 9 December 2022 14.00-15.00 UK time. Video (52 min)
- Guo-Wei Wei (Michigan State University, US).
- Title. Mathematics and AI are revolutionizing Biosciences.
- Abstract (doc with bio). Mathematics underpins fundamental theories in physics such as quantum mechanics, general relativity, and quantum field theory. Nonetheless, its success in modern biology, namely cellular biology, molecular biology, biochemistry, genomics, and genetics, has been quite limited. Artificial intelligence (AI) has fundamentally changed the landscape of science, technology, industry, and social media in the past few years and holds a great promise for discovering the rules of life. However, AI-based biological discovery encounters challenges arising from the structural complexity of macromolecules, the high dimensionality of biological variability, the multiscale entanglement of molecular, cell, tissue, organ, and organism networks, the nonlinearity of genotype, phenotype, and environment coupling, and the excessiveness of genomic, transcriptomic, proteomic, and metabolomic data. We tackle these challenges mathematically. Our work focuses on reducing the complexity, dimensionality, entanglement, and nonlinearity of biological data. We have introduced persistent cohomology and various topological Laplacians, including evolutionary Hodge Laplacian, persistent Laplacian, persistent sheaf Laplacian, and persistent path Laplacian to model complex, heterogeneous, multiscale biological systems and thus significantly enhance AI's ability to handle biological data. Using our mathematical AI approaches, my team has been the top winner in D3R Grand Challenges, a worldwide annual competition series in computer-aided drug design and discovery for years. By further integrating with millions of genomes isolated from patients, we reveal the natural selection mechanisms of SARS-CoV-2 evolution and accurately forecast emerging SARS-CoV-2 variants.
- 18 November 2022 14.00-15.00 UK time. Video (42 min).
- April Lynne D. Say-awen (Ateneo de Manila University, Philippines).
- Title. On the Frequency Module of the Hull of a Primitive Substitution Tiling.
- Abstract.
This talk is based on a joint paper with Dirk Frettlöh and Ma. Louise Antonette N. De Las Peñas where a method will be presented on how to arrive at the frequency module of the hull of a primitive substitution tiling. The frequency module of a tiling T is the minimal Z-module that contains the absolute frequency of each patch of T. The method involves introducing a new substitution tiling T' that is derivable from T.
Keywords: frequency module, primitive substitution tilings, aperiodic tilings, quasicrystals
- 21 October 2022 14.00-15.00 UK time. Video (55 min).
- Vitaliy Kurlin (Materials Innovation Factory, University of Liverpool, UK).
- Title. Density functions of periodic sequences.
- Abstract. The past work at SoCG 2021 introduced an infinite sequence of density functions, which are continuous isometry invariants of a periodic point set. For any integer k>=0, the k-th density function of a variable radius measures the fractional volume of the ambient space covered by exactly k balls centred at all given points. These density functions are highly non-trivial even in dimension 1 for periodic sequences of points in the line. The new work at DGMM 2022 fully describes the density functions of any periodic sequence and their symmetry properties. The explicit description confirms coincidences of density functions for a pair of non-isometric sequences that were previously computed via finite samples.
- 14 October 2022 12.00-13.00 UK time. Video (40 min).
- José Miguel Hernández Lobato (University of Cambridge, UK).
- Title. Meta-learning Adaptive Deep Kernel Gaussian Processes for Molecular Property Prediction.
- Abstract. We propose Adaptive Deep Kernel Fitting with Implicit Function Theorem (ADKF-IFT), a novel framework for learning deep kernel Gaussian processes (GPs) by interpolating between meta-learning and conventional deep kernel learning. Our approach employs a bilevel optimization objective where we meta-learn generally useful feature representations across tasks, in the sense that task-specific GP models estimated on top of such features achieve the lowest possible predictive loss on average. We solve the resulting nested optimization problem using the implicit function theorem (IFT). We show that our ADKF-IFT framework contains previously proposed Deep Kernel Learning (DKL) and Deep Kernel Transfer (DKT) as special cases. Although ADKF-IFT is a completely general method, we argue that it is especially well-suited for drug discovery problems and demonstrate that it significantly outperforms previous SOTA methods on a variety of real-world few-shot molecular property prediction tasks and out-of-domain molecular property prediction and optimization tasks.
- 7 October 2022 14.00-15.00 UK time. Video (55 min).
- Vitaliy Kurlin (Materials Innovation Factory, University of Liverpool, UK).
- Title. Degree-k Voronoi domains of periodic point sets.
- Abstract. The talk is based on a joint paper with Phil Smith. Degree-k Voronoi domains of a periodic point set are concentric regions around a fixed centre consisting of all points in Euclidean space that have the centre as their k-th nearest neighbour. Periodic point sets generalise the concept of a lattice by allowing multiple points to appear within a unit cell of the lattice. Thus, periodic point sets model all solid crystalline materials (periodic crystals), and degree-k Voronoi domains of periodic point sets can be used to characterise the relative positions of atoms in a crystal from a fixed centre. We describe the first algorithm to compute all degree-k Voronoi domains up to any degree k>0 for any two or three-dimensional periodic point set.
- 17 June 2022 14.00-15.00 UK time. Video (59 min). Slides (pdf)
- Michele Ceriotti (EPFL, Switzerland).
- Title. A unified theory of atom cloud representations for machine learning.
- Abstract. Simulations of matter at the atomic scale are precious to provide a mechanistic
understanding of chemical processes, and to design molecules and materials with
predictive accuracy. As with many fields of science, machine learning have become
an essential part of the modeling toolbox, with many frameworks having become
well-established, and many more being developed in new research directions.
The most effective frameworks incorporate fundamental physical principles, such as symmetry, locality, and hierarchical decompositions of the interactions between atoms, in the construction of the ML model.
I will discuss a general framework that unifies several of the most recent developments in the field, including the representation of structures in terms of systematically-convergent atom-centered correlations of the neighbor density, as well as equivariant message-passing schemes that build automatically descriptors with equivalent information content.
I will discuss a few examples of the implications of these fundamental findings, for both chemical machine learning and in general for problems that require a description of three-dimensional objects in terms of point clouds, and present some examples that highlight the limitations of some common approaches and point to strategies to overcome them.
- 10 June 2022 15.00-16.00 UK time. Video (45 min).
- Jan Rohlíček (Institute of Physics of the Czech Academy of Sciences).
- Title. CrystalCMP : automatic comparison of molecular structures.
- Abstract. This article describes new developments in the CrystalCMP software. In particular, an automatic procedure for comparison of molecular packing is presented. The key components are an automated procedure for fragment selection and the replacement of the angle calculation by root-mean-square deviation of atomic positions. The procedure was tested on a large data set taken from the Cambridge Structural Database (CSD) and the results of all the comparisons were saved as an HTML page, which is freely available on the web. The analysis of the results allowed estimation of the threshold for identification of identical packing and allowed duplicates and entries with potentially incorrect space groups to be found in the CSD.
- 26 May 2022 14.00-15.00 UK time. Video (67 min). Slides (pdf)
- Ryoko Tomiyasu (Kyushu University, Japan).
- Title. Reduction theory for determining the Bravais type of unit-cell parameters containing observation errors.
- Abstract. In crystallography, the parameters of the unit cells or crystal structures recovered from experimental data are required to be output using their conventional cells. Namely, the lattice-basis reduction must be applied, and the software also transforms the parameters of the primitive cells into those of the conventional cells, based on their symmetries (centering types).
The influence of observational errors cannot be ignored during this algebraic calculation. In 2012, the speaker provided an algorithm for 3D lattices containing observational errors; similar results have been obtained for 2D lattices. The algorithm is new in the sense that all the candidates for nearly reduced bases were explicitly given to speed up the algorithm, and we proved based on the Venkov reduction that it works correctly for parameters with errors of various sizes, ranging from rounding errors to observation errors. The Eisenstein and Selling reductions (known as the Niggli and Delaunay reductions in crystallography) can be regarded as a special case of the Venkov reduction. The code has been used in the author's research in geometry of numbers, and in the ab-initio indexing software CONOGRAPH for PXRD and EBSD distributed by and for experimental scientists.
In the course of the research, the speaker had the opportunity to study the history of the reduction theory in crystallography, which will be mentioned as well as the background in mathematics.
- 20 May 2022 14.00-15.00 UK time. Video (52 min)
- Cyril Cayron (EPFL, Switzerland).
- Title. A method to reduce N-dimensional lattices and solve N-dimensional Bézout's identities.
- Abstract. In order to predict deformation twins in metals and minerals, simple shears on different integral (reticular) planes should be determined. Calculating the shear vectors associated with a given shear plane requires to find a reduced unit cell attached to this plane. Different connected mathematical problems thus emerge: Bézout's identities, integer relations, and cell/lattice reduction. Quite ironically, instead of applying a pre-established lattice reduction method such as LLL, it appeared that simple shearing itself (hyperplanar shearing in dimension N > 3) can be used to reduce the unit cell, and solve the associated N-dimensional Bézout's identity. When applied recursively on different integral planes, hyperplane shearing also permits to reduce N-dimensional lattices by minimizing their metric "rhombicity". The talk will explain the different steps of the method and their geometrical meanings. Some examples with real crystals (N = 3) will be given to explain how the reduction method permits determine the Bravais lattice from diffraction measurements.
- 13 May 2022 14.00-15.00 UK time. Video (44 min)
- James Cumby (University of Edinburgh, UK).
- Title. Encoding chemical similarity using pairwise distances.
- Abstract. In order to design new materials that fulfil a desired function, we require tools to rapidly predict physical properties from the underlying atomic structure. Statistical machine learning is accelerating advances in this area, but a key limitation remains; accurately predicting properties requires an effective measure of similarity between crystal structures. Here, we show that a representation based on grouped pairwise inter-atomic distances (GRID) combined with earth mover's distance (EMD) gives an accurate reflection of chemical space, and can be used to predict multiple physical properties using a simple regression model.
- 6 May 2022 14.00-15.00 UK time. Video (55 min)
- Emre S. Tasci (Hacettepe University, Turkey).
- Title. Using Group Theory to Relate and Compare Structures.
- Abstract. Application of group theory in crystallography and solid state physics brings powerful impositions that drastically limit the possible outcomes or intermediate pathways to manageable potential cases. The talk will begin by a brief introduction of group theory and symmetry, along with symmetry related concepts such as symmetry operators, site symmetries, Wyckoff positions, group settings, group-subgroup relations and will continue on how to relate given structures along with quantitative means of comparison. The presentation will employ tools served by the Bilbao Crystallographic Server (BCS) [https://www.cryst.ehu.es/] such as WYCKPOS, SUBGROUPGRAPH, MINSUP, WYCKSPLIT, PSEUDO, COMPSTRU and STRUCTURE RELATIONS, all of which are freely available at BCS.
- 29 April 2022 14.00-15.00 UK time. Video (55 min)
- Gregory McColm (University of South Florida, US).
- Title. Using Symmetry-Based Equivalence Relations in Classifying Crystallographic Structures
- Abstract. One way to classify geometric figures - including figures of interest to crystallographers - is to fix a hierarchy of groups acting on the underlying geometric space, and from these define a hierarchy of equivalence relations arising from these groups. We look at some of the more popular groups acting on Euclidean spaces, with some kinds of figures popular among crystallographers. Time permitting, we look at some variants within this classification system.
- 26 April 2022 14.00-15.00 UK time (joint with LIV.DAT seminar). Video (44 min).
- Vitaliy Kurlin (Materials Innovation Factory, University of Liverpool, UK).
- Title. The Crystal Isometry Principle.
- Abstract. Any solid crystalline material (periodic crystal) consists of periodically translated unit cells of atoms or molecules. Since crystal structures are determined in a rigid form, the strongest and most practical equivalence of crystals is rigid motion (a composition of translations and rotations) or isometry (also including reflections). Periodic crystals were classified by coarser equivalences such as space groups into 219 types (230 if mirror images are distinguished). However, the world's largest Cambridge Structural Database (CSD) of 1.1M+ existing crystals requires finer classifications.
The Data Science Theory and Applications group at the Materials Innovation Factory developed generically complete and continuous isometry invariants for periodic sets of atomic centres. Computing these invariants for all 660K+ periodic crystals (full 3D structure; no disorder) in the CSD through 200 billion+ pairwise comparisons over two days on a modest desktop detected five pairs of "identical needles in a haystack". For example, the CSD crystals HIFCAB and JEPLIA are truly isometric, but one atom of Cadmium is replaced by Manganese, which should inevitably perturb a local geometry of atoms. As a result, five journals are now investigating the data integrity of the underlying publications.
These experiments justified the Crystal Isometry Principle, which states that any periodic crystal is uniquely determined by its geometry of atomic centres without chemical information. Then all known and undiscovered periodic crystals live in a common Crystal Isometry Space (CRISP), so that Mendeleev’s periodic table representing individual elements, categorised by two discrete parameters (atomic number and group), can be extended into a continuous space for all solid crystalline materials. For instance, diamond and graphite, both consisting purely of carbon, have different locations in CRISP.
- 8 April 2022 14.00-15.00 UK time. Video (59 min).
- Larry Andrews (Ronin Institute, US).
- Title. Comparing crystal lattices.
- Abstract. Comparing crystal lattices is necessary for several uses: databases of unit cell parameters, Bravais lattice type determination, clustering of cells from serial crystallography at XFELs and synchrotrons elimination of duplicate and near duplicates in crystal structure prediction. The talk will include a brief, simple review of linear algebra. The mechanics of measuring the difference of pairs of unit cells will be discussed ("distance" between lattices). That leads naturally to a discussion of how to determine Bravais lattice types.
- 1 April 2022 14.00-15.00 UK time. Slides (pdf, 1.4M).
- Open table with MaThCryst colleagues and other crystallographers.
- Title. A discussion of continuous invariants and metrics on crystals.
- Abstract. During the great discussion on 25th March with many crystallographers across the world, we agreed that the strongest possible equivalence of periodic crystals is rigid motion (or isometry), which cannot really change any crystal stucture. We also decided to continue next week in a similar format. The next step is to understand which invariants can reliably distinguish crystals up to isometry and how to avoid arbitrary thresholds to continuously quantify the similarity of crystals. One motivation is to automatically identify near duplicates that appear in experimental databases and among simulated structures obtained as different approximations of the same energy minimum in Crystal Structure Prediction.
- 25 March 2022 14.00-15.00 UK time. Slides (pdf, 1M).
- Open table with MaThCryst colleagues and other crystallographers.
- Title. A discussion of equivalences and invariants of periodic crystals.
- Abstract. Periodic lattices and more general periodic point sets model any solid crystalline materials and can be studied modulo various equivalence relations. Historically, all early crystals looked very symmetric and were distinguished by their symmetries. This traditional classification splits all periodic crystals into 219 space-group types (230 if mirror images are distinguished). The recent review by Massimo Nespolo, Mois Aroyo, and Bernd Souvignier gave excellent examples of potential confusion even between "groups" and "group types". Since many similar words have different meanings in closely related areas such as crystallography and mathematics (algebra and geometry), it makes sense to discuss the basic concepts, for example, lattices, equivalences, invariants, abstract groups and their linear (matrix) representations.
- 11 March 2022 14.00-15.00 UK time. Video (60 min).
- Vitaliy Kurlin (Materials Innovation Factory, University of Liverpool, UK).
- Title. Continuous metrics and chiral distances for 2-dimensional lattices.
- Abstract. The paper "Mathematics of 2-dimensional lattices" (the latest pdf, an early version in arxiv:2201.05150) introduced continuous metrics on the spaces of 2-dimensional lattices considered up to isometry, rigid motion, similarity (with uniform scaling) in the plane. These spaces contain the non-oblique Bravais classes (hexagonal, tetragonal (square), rectangular, centered-rectangular) as low-dimensional strata. A continuous distance from a lattice to the subspace of non-oblique lattices can be considered as a real-valued chiral measure for a deviation of a given lattice from its closest higher-symmetry neighbor. We will discuss explicit formulae of metrics and chiral distances, and their computations for millions of crystal lattices extracted from the Cambridge Structural Database in the joint work with Matt Bright and Andy Cooper.
- 4 March 2022 14.00-15.00 UK time.
- James Darby (University of Cambridge, UK).
- Title. Compressing local atomic neighbourhood descriptors.
- Abstract. Many atomic descriptors are currently limited by their unfavourable scaling with the number of chemical elements S. For instance, the length of body-ordered descriptors typically scales as (NS)^v where v+1 is the body-order and N is the number of radial basis functions used in the density expansion. We introduce two distinct approaches which can be used to overcome this scaling for the Smooth Overlap of Atomic Positions (SOAP) power spectrum. Firstly, we show that the power spectrum is amenable to lossless compression with respect to both S and N, so that the descriptor length can be reduced from O(N^2 S^2) to O(NS). Secondly, we introduce a generalized SOAP kernel, where compression is achieved through the use of the total, element agnostic density, in combination with radial projection. The ideas used in the generalized kernel are equally applicably to any other body-ordered descriptors and we demonstrate this for the Atom Centered Symmetry Functions (ACSF). Finally, both compression approaches are shown to offer comparable performance to the original descriptor across a variety of numerical tests.
- 28 January 2022 14.00-15.00 UK time. Video (54 min).
- Vitaliy Kurlin (Materials Innovation Factory, University of Liverpool, UK).
- Title. Geographic-style maps of crystal lattices from the Cambridge Structural Database.
- Abstract. The Crystal Isometry Principle (first presented at MACSMIN 2021) says that all real non-equivalent crystals should be geometrically recognizable by their periodic structures of atomic centres without any chemical data. As a consequence, all known and not yet discovered periodic crystals live in a common Crystal Isometry Space (CRISP) similarly to all chemical elements in Mendeleev's table. The space CRISP can be continuously and unambiguously parametrised by complete isometry invariants, which found unexpected duplicates in the Cambridge Structural Database (CSD), now under investigation by five journals. The talk will present visual maps of CRISP for hundreds of thousands of crystal lattices from the CSD. These maps support the natural preference for higher symmetry crystals and experimentally justify the continuity of CRISP. The latest versions of the key papers are linked at the Geometric Data Science page.
- 14 January 2022 14.00-15.00 UK time.
- Michael White (University of Manchester, UK).
- Title. Digital Fingerprinting of Microstructures.
- Abstract. A statistical framework is systematically developed for compressed characterisation of a population of images, which includes some classical computer vision methods as special cases. The focus is on materials microstructure. The ultimate purpose is to rapidly fingerprint sample images in the context of various high-throughput design/make/test scenarios. This includes, but is not limited to, quantification of the disparity between microstructures for quality control, classifying microstructures, predicting materials properties from image data and identifying potential processing routes to engineer new materials with specific properties. Here, we consider microstructure classification and utilise the resulting features over a range of related machine learning tasks, such as supervised, semi-supervised, and unsupervised learning.
Back to All seminars | Back to Top of this page | Back to Home page
- 10 December 2021 14.00-15.00 UK time. Video (55 min).
- Vitaliy Kurlin (Materials Innovation Factory, University of Liverpool, UK).
- Title. Visual maps of the isometry space of 3-dimensional lattices.
- Abstract. Since crystal structures are determined in a rigid form, 3-dimensional periodic crystals and their lattices should be studied up to rigid motion or isometry, which are compositions of translations, rotations, and reflections. The resulting space of isometry classes of lattices is infinite and was previously studied mostly by discrete invariants, which split this continuous space into a finite number of strata or low-dimensional subspaces representing high-symmetry lattices. We extend the continuously parametrised map of all lattices in dimension 2 (joint with Matt Bright and Andy Cooper) to dimension 3 and also introduce easy metrics on lattices, which are continuous under perturbations of lattice bases.
- 26 November 2021 14.00-15.00 UK time.
- Andreas Alpers (Liverpool Mathematical Sciences).
- Title. Tomographic Imaging of Crystalline Structures: Mathematics & Materials Science.
- Abstract. Since the invention of the first CT scanner in the 1970s, tomography has evolved into a powerful imaging tool in modern medicine. By computer processing of scans acquired from multiple angles, tomography can provide 3D images of inner parts without having to cut the patient. Of course, tomography has also been found to be attractive in materials science. Here, however, new fundamental challenges arise as, for instance, crystalline objects are non-continuous, data is (or can be) acquired only from few angles, and data can be very noisy. It therefore becomes increasingly important to develop problem-specific reconstruction algorithms. As I will discuss in my talk, this can lead into areas of mathematics previously thought to be unrelated to materials science. And, vice versa, the materials science imaging tasks can motivate new developments in mathematics. I will focus in my talk on this interplay between mathematics and materials science, drawing on two examples from my research: the tomographic reconstruction of polycrystals from diffraction data and the reconstruction of nanowires from electron tomography data.
- 12 November 2021 14.00-15.00 UK time. Video (67 min).
- Vitaliy Kurlin (Materials Innovation Factory, University of Liverpool, UK).
- Title. Continuous metrics on isometry classes of 2-dimensional lattices.
- Abstract. A periodic lattice is an infinite set of all integer linear combinations of basis vectors in a Euclidean space. A natural equivalence on lattices is rigid motion (a composition of translations and rotations) or isometry (also including reflections). All 2-dimensional lattices can be parametrised by three real parameters, but there was no easy metric on isometry classes of lattices, which is also continuous under basis perturbations. Using a recent parametrisation of 2-dimensional lattices by root forms, we introduce metrics that are invariant up to isometry or up to rigid motion, which preserves orientation. The latter case also quantifies chirality by measuring a distance from a chiral lattice to its closest mirror-symmetric neighbour. The talk is based on the updated dimension 2 paper joint with Matt Bright and Andy Cooper.
- 29 October 2021 14.00-15.00 UK time.
- Steff Farley (Loughborough University Mathematical Sciences).
- Title. Quantitative Data Integration with Computational Models of Dewetting Processes.
- Abstract. The self-assembly of nanostructures has been of growing interest in materials science, with particular advancements in the development of computational models that describe this self-assembly. So far, however, the utility of these models have been limited by the absence of methods to integrate real experimental data with numerical simulations or the experimental and simulation conditions that generate them. We simulate images of the resulting nanostructures using two models, a kinetic Monte Carlo (KMC) model and a dynamical density functional theory (DDFT) model, and attempt to relate a dataset of 2625 real atomic force microscope (AFM) images with these models. We propose a map to a feature space of modified Minkowski functionals that meaningfully characterise the geometry of this image space as the first step of this task. These coordinate statistics show some promise of allowing us to successfully carry out the inverse problem but are insufficient for this task when used alone. We propose that drawing from the methods of Riemannian geometry in combination with Approximate Bayesian Computation should be considered as a more suitable approach.
- 15 October 2021 14.00-15.00 UK time. Video (58 min).
- Vitaliy Kurlin (Materials Innovation Factory, University of Liverpool, UK).
- Title. Complete isometry invariants of periodic lattices in dimensions 2 and 3.
- Abstract. Motivated by crystal structures determined in a rigid form, we study lattices up to rigid motion or isometry, which is a composition of translations, rotations and reflections. The resulting Lattice Isometry Space (LISP) consists of infinitely many isometry classes of lattices. In dimensions 2 and 3, we parametrise this continuous space by root forms (consisting of three and six numbers) and introduce new metrics on root forms satisfying all metric axioms and continuity under perturbations. These parametrisations help visualise hundreds of thousands of crystal lattices in the Cambridge Structural Database. The talk is based on the dimension 2 and dimension 3 papers with Matt Bright and Andy Cooper.
- 9 July 2021 14.00-15.00 UK time.
- Daniel Widdowson (Materials Innovation Factory, University of Liverpool, UK).
- Title. Invariant-based visualisation of large crystal datasets.
- Abstract. Any solid crystalline material can be represented by a periodic set of points at atomic centres. Crystals or periodic point sets can be distinguished in the strongest possible way up to rigid motion or isometry (a composition of rotations and translations). The talk will discuss continuous isometry invariants that reliably quantify similarities between periodic point sets. These AMD invariants produce distance matrices for any crystal dataset, which may come from Crystal Structure Prediction or be more diverse and include different chemical compositions. Invariant-based distance matrices can be visualised in the form of a tree map showing all crystals connected to their closest neighbours. The AMD invariants are so fast that a map of all 229K molecular organic crystals from the Cambridge Structural Database was produced in less than nine hours on a modest desktop, see the last image in the AMD paper.
- 26 March 2021 14.00-15.00 UK time.
- Phil Smith (Materials Innovation Factory, University of Liverpool, UK).
- Title. The Density Fingerprint of a Periodic Point Set.
- Abstract. Modeling a crystal as a periodic point set, we present a fingerprint consisting of density functions that facilitates the efficient search for new materials and material properties. We prove invariance under isometries, continuity, and completeness in the generic case, which are necessary features for the reliable comparison of crystals. The proof of continuity integrates methods from discrete geometry and lattice theory, while the proof of generic completeness combines techniques from geometry with analysis. The fingerprint has a fast algorithm based on Brillouin zones and related inclusion-exclusion formulae. We have implemented the algorithm and describe its application to crystal structure prediction. The talk is based on the paper with H.Edelsbrunner, M.Wintraecken, T.Heiss, V.Kurlin, accepted at SoCG 2021.
Back to All seminars | Back to Top of this page | Back to Home page
- 17 December 2020 15.15-16.45 UK time, joint with the Computational Mathematics Seminar.
- Vitaliy Kurlin (Materials Innovation Factory, University of Liverpool, UK).
- Title. Introduction to Periodic Geometry for applications in Crystallography and Materials Discovery.
- Abstract. A periodic crystal is modeled as a periodic set of zero-sized points in 3-space. Such a periodic set is usually given by a unit cell (a parallelepiped defined by 3 edge-lengths and 3 angles) and a motif of points with fractional coordinates in this cell. Representing a crystal as a unit cell plus a motif is highly ambiguous. Hence a reliable comparison of rigid crystals should be based on isometry invariants that are preserved under any rigid motions, hence are independent of a unit cell and a motif. The talk will discuss a stable isometry classification of periodic crystals. Most past invariants of crystals such as symmetry groups are discontinuous under atomic vibrations. The new classification establishes foundations of periodic geometry. Based on joint papers with Andy Cooper, Angeles Pulido, Herbert Edelsbrunner, Mathijs Wintraecken, Teresa Heiss, Phil Smith, Marco Mosca, Dan Widdowson, e.g. arxiv:2009.02488.
- 22 October 2020 15.00-16.00 (UK), 7.00-8.00 (California).
- Larry Andrews (Ronin Institute, US).
- Title. The Rise and Fall of the V7 Metric.
- Abstract. Having a measure of the difference between two lattices has several applications: databases of unit cells, clustering of images from serial crystallography, planning of epitaxial processes. Originally, unit cells were compared and cataloged using their sorted unit cell edge lengths. This technique has several problems. The first complete measure for lattices was the V7 metric (Andrews, Bernstein, Pelletier, 1980). It was complete, simple to understand, and in database retrievals it would never have false negative results. However, it was discovered to have a deficiency. The creation of V7, problems with it, and the evolution of methods for more correct distance measures will be discussed.
- 10 July 2020 13.30-15.00 UK time.
- Marco Mosca (Materials Innovation Factory, University of Liverpool, UK).
- Title. Distance-based isometry invariants of crystals.
- Abstract. The talk will consider the isometry classification of periodic sets motivated by the Crystal Structure Prediction. A periodic sets of points models any solid crystalline material (a crystal) and represents any atom or a molecular centre by a zero-sized point. Since most crystals are rigid bodies, the natural equivalence on periodic sets is a rigid motion (a composition of translations and rotations) or an isometry, which preserves distances between points. Traditionally crystals were distinguished by discrete invariants such as symmetry groups that are unstable under atomic vibrations. However, large datasets of simulated crystals contain too many nearly identical crystals obtained as slightly different approximations to local minima of an energy function. The new distance-based isometry invariants are provably stable under noise. The experiments on the T2 dataset from the 2017 Nature paper show that the new invariants distinguish simulated crystals better than past tools and can automatically match experimental crystals to their closest simulated versions.
- 19 May 2020 10.30-12.00 UK time.
- Vitaliy Kurlin (Materials Innovation Factory, University of Liverpool, UK).
- Title. Isometry classification of periodic crystals.
- Abstract. The Crystal Structure Prediction (CSP) tools output too many simulated crystals, including many nearly identical local minima of an energy, because there was no reliable way to quantify the similarity between crystals. Since most solid crystalline materials are rigid, the similarity problem is now rigorously stated as a classification modulo rigid motions or isometries that preserve inter-point distances. Most descriptors, especially if they are based on ambiguous unit cells, are either not isometry invariants or are unstable under atomic vibrations, hence can not reliably quantify crystal similarities. The talk introduces three approaches to build stable-under-noise and complete isometry invariants of periodic crystals.
- 3 February 2020 11-13 (lunch and discussion from about 12).
- Uwe Grimm (Open University School of Mathematics and Statistics).
- Location : the boardroom on the ground floor in the MIF.
- Title. An introduction to aperiodically ordered systems and their diffraction.
- Abstract. The talk will present an introductory overview on aperiodically ordered systems obtained from cut and project or inflation methods. The key methods and results will be illustrated by means of examples and pictures, leaving aside much of the mathematical details. For cut and project sets, the diffraction is well understood, and the general result and its application to examples will be discussed. In contrast, inflation systems are still only partially understood, and paradigms with different spectral type are presented. Finally, I will briefly discuss recent results on the Fourier transform of Rauzy fractals and the point spectrum of unimodular Pisot inflation tilings, which uses both the inflation and the projection structure.
Back to All seminars | Back to Top of this page | Back to Home page
- 13 December 2019 : Tom Hasell (Liverpool Chemistry).
- Time : 11-13 (lunch and discussion from about 12).
- Location : the 3rd floor meeting room in the MIF.
- Title. Making polymers from elemental sulfur
- Abstract. Sulfur is an industrial by-product, removed as an impurity in oil-refining. This has led to vast unwanted stockpiles of sulfur, and resulted in low bulk prices. Sulfur is therefore a promising alternative feedstock to carbon for polymeric materials. Sulfur normally exists as S8 rings – a small molecule with poor physical properties. On heating, these sulfur rings can open and polymerise to form long chains. However, because of the reversibility of sulfur bonds, these polymers are not stable, and decompose back to S8 over time, even at room temperature. Inverse vulcanisation has made possible the production of high sulfur polymers, stabilised against depolymerisation by crosslinking [1]. These high sulfur-materials show excellent potential as low cost water filters to remove mercury [2-4]. Heavy metal contamination exists in the waste streams of many industries, and mercury is of particular concern for human health. Alternative crosslinkers for inverse vulcanisation, from industrial by-products or bio-renewable sources, can be used to reduce the cost and improve the properties of the resultant polymers [4,5]. Polymers made from sulfur also have many other intriguing properties and applications in optics, electronics, insulation, and antimicrobial materials. We recently reported a catalytic route that reduces the required reaction time, temperature, and by-products – and allows otherwise unreactive crosslinkers to be used [6].
[1] Nat. Chem. 2013, 5, 518–524. [2] Chem. Commun., 2016, 52, 5383 - 5386. [3] J. Mater. Chem. A. 2017,5, 11682-11692 [4] J. Mater. Chem. A., 2017, 5, 18603 [5] Chem.-Eur. J. 2019, 25, 10433-10440 [6] Nat. Commun. 2019, 10, 647
- 4 November 2019 : Linjiang Chen (Liverpool Chemistry).
- Time : 11-13 (lunch and discussion from about 12).
- Location : the 3rd floor meeting room in the MIF.
- Title. Functional materials discovery: ‘Seeing’ structures and functions?
- Abstract. We have recently demonstrated a strategy for how new, functional molecular materials can be discovered. By carrying out a priori prediction of both the crystal structure and its functional properties, energy–structure–function (ESF) maps are created to aid researchers, without computational expertise, in realizing several remarkable porous materials promising for different possible applications.
As large-scale computational screening studies become routinely carried out, new opportunities have arisen for accelerating materials discovery by taking advantage of the availability of big data. It, however, remains a challenge to properly understand and use the vast amount of data generated by simulations. In my talk, I will show how we use computation to accelerate serendipitous discovery of new functional materials, as well as the prowess of data-driven machineries becoming available. I will outline our current focuses on and approaches to the following topics:- 1. ESF mapping to identify synthetic targets with desired functional properties;
- 2. Interactive visualization of high-dimensional structure–property relationships;
- 3. Smart navigation of the ESF space with machine learning.
- 21 October 2019 : Georg Osang (IST Austria Computational Geometry).
- Time : 11-13 (lunch and discussion from about 12).
- Location : the boardroom on the ground floor in the MIF.
- Title. The Multi-cover Persistence of Euclidean Balls
- Abstract. Persistent homology has become a popular tool to analyse various kinds of data, in particular in material sciences. Specifically, persistence of discrete point sets has recently been used to analyse sphere packing data, to shed light on structures arising in sphere packings at different packing densities. We generalize this notion and introduce higher-order persistence of discrete point sets. We address computational challenges and show how this notion can deal with noisy point samples. In the setting of sphere packings we show that this notion can also capture a wider variety of local structures, and in particular can distinguish between the hexagonal close packing and the face centered cubic lattice packing, two structures know to have optimal packing density in 3 dimensions.
- 1 July 2019 : John Claridge (Liverpool Chemistry).
- Time : 11-13 (lunch and discussion from about 12).
- Location : the 3rd floor meeting room in the MIF.
- Title : Challenges and opportunities in condensed matter science.
- Abstract. Solid state chemistry, is the study of the synthesis, structure and properties of solid state materials and particularly how these are related to each other. It is obviously closely linked to condensed matter physics, mineralogy, crystallography, ceramics, metallurgy, materials science and electronics. We'll begin with some simple illustrations of these relationships, before looking at how structural/crystallographic methods have advanced in order to both characterise and describe the materials we can now make, and where I believe there are limitations in our current tools and understanding, with examples taken from my own work and more generally, and why they are important.
- 13 May 2019 : Vanessa Robins (ANU Applied Mathematics).
- Time : 11-12.15 (with lunch at VGM from 12.30).
- Location : the boardroom on the ground floor in the MIF.
- Title : An introduction to the mathematical description and computational enumeration of periodic tilings and nets.
- Abstract. Periodic tilings and nets are simplified models of atomic arrangements in crystals. Systematic methods for describing and enumerating possible crystal structures are based on factoring out either the translational periodicity or the full crystallographic symmetry. The mathematical version of “factoring out” gives us an object called an orbifold. I will explain how orbifolds encapsulate the essential elements of a crystallographic group and provide an effective basis for the enumeration of hypothetical crystalline structures. The presentation will be informal and largely based on pictures and examples, hopefully accessible to non-specialists.
- 29 April 2019 : Vladimir Gusev (Liverpool Chemistry).
- Time : 11-13 (lunch and discussion from about 12).
- Location : the boardroom on the ground floor in the MIF.
- Title : New approaches to the crystal structure prediction
- 1 April 2019 : Chris Collins (Liverpool Chemistry).
- Time : 11-13 (lunch and discussion from about 12).
- Location : the boardroom on the ground floor in the MIF.
- Title : Computationally driven materials discovery
- Abstract.
In the previous seminar by Dr D. Antypov, we discussed the general concepts of energy calculations and crystal structure prediction (CSP). In this session are to discuss how CSP is used in practice in the search for new inorganic materials.
We present how the use of ‘probe’ structures accelerates the experimental exploration of composition space. A probe structure is a hypothetical crystal structure for a given composition, it need not be the ground state. However, it needs to be complex enough that it can contain enough co-ordination chemistry to produce an energy which is representative of potential single phases. When probe structures are generated across a range of compositions we can then construct the convex hull, and look for minima – regions of compositional space in which new compounds are likely to be found.
We will discuss the methods for creating probe structures. The development of a method is split into two streams: how can we best represent crystal structures in a computer? And how can we search through the possible configuration space? We will then present our latest codes developed for probe structure prediction (Monte Carlo – Extended Module Materials Assembly (MC-EMMA[1]) and the Flexible Unit Structure Engine (FUSE[2]), along with example phase fields computed using them and show how probe structure generation for composition prediction is a powerful, predictive tool to accelerate the discovery of materials.
1. C. Collins, M. S. Dyer, M. J. Pitcher, G. F. S. Whitehead, M. Zanella, P. Mandal, J. B. Claridge, G. R. Darling and M. J. Rosseinsky, Nature, 2017, 546, 280-284.
2. C. Collins, G. R. Darling and M. J. Rosseinsky, Faraday Discussions, 2018, 211, 117-131.
- 11 March 2019 : Dmytro Antypov (Liverpool Chemistry).
- Time : 11-13 (lunch and discussion from about 12).
- Location : the 3rd floor meeting room in the MIF.
- Title : Introduction to crystal structure prediction
- Abstract. Crystalline solids such as metal oxides and perovskites are functional materials used in solar cells, batteries and many other devices. These materials are often composed from 3, 4 or 5 different chemical elements arranged in a periodic three dimensional structure. Such chemical diversity on the one hand allows for fine-tuning of material properties but on the other hand makes the identification and synthesis of stable compounds difficult. To accelerate the design of such materials we use computation to predict the combinations of the constituent chemical elements that will lead to stable crystalline structures. In this talk I will explain how this chemistry problem is formalised to have a tractable computational solution and show some examples.
- 25 February 2019 : Phillip Maffettone (Liverpool Chemistry).
- Time : 11-13 (lunch and discussion from about 12).
- Location : the 3rd floor meeting room in the MIF.
- Title : Describing solids, and challenges for high-throughput analysis.
- Abstract. The talk will introduce reciprocal space and the reciprocal lattice as the inverse of the direct lattice, building on the previous introduction describing periodic structures. The reciprocal lattice plays a fundamental role in understanding wave mechanics in crystalline materials, and is implemented in most analytical studies of materials through the theory of diffraction. These concepts will be presented with a focus on their experimental utility, necessary limitations, and potential implications for high-throughput automation. In particular, we will explore the question: When can we meaningfully invert the diffraction pattern and the pair-wise information it contains, and how can we automate this inversion?
- 11 February 2019 : Mike Gaultois (Liverpool Chemistry).
- Time : 11-13 (lunch and discussion from about 12).
- Location : the 3rd floor meeting room in the MIF.
- Title : Searching for new magnetic materials.
- Abstract. Magnetic materials have fascinated humans since the antiquity, but only recently since the 1800's with the work of Oersted and Maxwell did we begin to develop a deeper understanding of magnetism. Magnetic materials are now involved in many technological applications, many of which are limited by the material performance. One area of enquiry is the search for materials that can make hard permanent magnets, and another area of enquiry is the search for materials that conduct electricity without resistance (i.e., superconductors). This discussion is interested in exploring how we can best computationally predict (through first principles, or otherwise) the magnetic properties of a given material. We will describe what others have done to search for new materials using machine learning techniques, and discuss potential areas of interaction between MIF++ where advances can be made.
- 28 January 2019 : Vitaliy Kurlin (Materials Innovation Factory, University of Liverpool, UK).
- Time : 11-13 (lunch and discussion from about 12).
- Location : the boardroom on the ground floor in the MIF.
- Title : Mathematical challenges for solid crystalline materials.
- Abstract. The talk will introduce periodic structures that model all solid crystalline materials, for example molecular crystals, covalent organic frameworks and inorganic crystals based on isolated atoms or ions. The key problems of materials discovery will be stated in mathematical terms: what structures should be equivalent and how to quantify a similarity between crystals. We will discuss several requirements for potentials solutions that should work for real crystals.
Back to Top of this page | Back to Home page