Optimization of Photonic Devices: Implementation of Auto-Differentiable Numerical Methods in Open-Source Software

Séminaire GDR Ondes

Benjamin Vial

Imperial College London

Introduction

What is topology optimization?

A mathematical method that optimizes material layout within a given design space, for a given set of sources, boundary conditions and constraints with the goal of maximizing the performance of the system

Topology optimization Hello World!: Maximizing a beam stiffness with fixed volume fraction (Bleyer 2018)

Structural Engineering

Qatar National Convention Centre

Aeronautics

Airplane wing (Aage et al. 2017)

Photonics

(Molesky et al. 2018)

Topology optimization: recipes

Density function
\(p \in [0,1]\): material distribution in design domain \(\Omega_{\rm des}\)
Filtering
Convolution: \(f(\B r) = \frac{1}{A}{\rm exp}(-|\B r|^2 /R_f^2)\), with \(\int_{\Omega_{\rm des}} f(\B r) =1\) \[\begin{equation*} \densf(\B r) = p * f = \int_{\Omega_{\rm des}} p(\B r') f(\B r -\B r') {\rm d} \B r' \label{eq:gaussian_filt} \end{equation*}\] PDE: \(-R_f^2 \B\nabla ^2 \densf + \densf = p {\quad\rm on \,}\Omega_{\rm des}, \grad\densf\cdotp\B n = 0 {\quad\rm on \,}\partial\Omega_{\rm des}\) (Lazarov et al. 2011)

Topology optimization: recipes

Projection

\[\densp(\densf) = \frac{\tanh\left[\beta\nu\right] + \tanh\left[\beta(\densf-\nu)\right] }{\tanh\left[\beta\nu\right] + \tanh\left[\beta(1-\nu)\right]}\] with \(\nu=1/2\) and \(\beta>0\) increased during the course of the optimization. (Wang et al. 2010)
Interpolation

\(\varepsilon(\densp)=(\varepsilon_{\rm max}-\varepsilon_{\rm min})\,\densp^m + \varepsilon_{\rm min}\) (Bendsøe et al. 1999)

Algorithm

gradient based optimization algorithm
method of moving asymptotes (Svanberg 2002), free implementation via the nlopt package (Johnson 2007)
40 iterations or until convergence on the objective or design variables
repeated setting \(\beta =2^n\), where \(n\) is an integer between 0 and 7, restarting the algorithm with the optimized density obtained at the previous step

Computing gradients

Solution vector \(\B u\) depends on a vector of parameters \(\B p\) of size \(M\) and defined implicitly through an operator \(\B F\) as: \[ \B F(\B u, \B p) = \B 0 \qquad(1)\]

\(\B G\) is a functional of interest of dimension \(N\), representing the quantity to be optimized

Finite differences

\[ \frac{\mathrm{d}\B G}{\mathrm{d}p_i} \approx \frac{\B G(\B p + h \B e_i) - \B G(\B p)}{h} \] where \(\B e_i\) is the vector with \(0\) in all entries except for \(1\) in the \(i^{th}\) entry.

numerical inaccuracy
expensive for large \(M\) and/or \(N\)

Computing gradients

Tangent linear equation

Explicitly, the gradient can be computed applying the chain rule: \[ \frac{\mathrm{d}\B G}{\mathrm{d}\B p} = \frac{\partial \B G}{\partial \B p} + \frac{\partial \B G}{\partial \B u} \frac{\mathrm{d}\B u}{\mathrm{d}\B p}. \qquad(2)\] Taking the total derivative of Equation 1 we obtain the tangent linear equation: \[ {\frac{\partial \B F(\B u, \B p)}{\partial \B u}} {\frac{\mathrm{d}\B u}{\mathrm{d}\B p}} = {-\frac{\partial \B F(\B u, \B p)}{\partial \B p}}. \]

Adjoint equation

Assuming the tangent linear system is invertible, we can rewrite the Jacobian as: \[ \frac{\mathrm{d}\B u}{\mathrm{d}\B p} = - \left(\frac{\partial \B F(\B u, \B p)}{\partial \B u}\right)^{-1} \frac{\partial \B F(\B u, \B p)}{\partial \B p}. \] After substituting this value in Equation 2 and taking the adjoint (Hermitian transpose, denoted by \(\dagger\)) we get: \[ \frac{\mathrm{d}\B G}{\mathrm{d}\B p}^{\dagger} = \frac{\partial \B G}{\partial \B p}^{\dagger} - \frac{\partial \B F(\B u, \B p)}{\partial \B p}^{\dagger} \left(\frac{\partial \B F(\B u, \B p)}{\partial \B u}\right)^{-\dagger} \frac{\partial \B G}{\partial \B u}^{\dagger} . \]

Computing gradients

Adjoint equation

Defining the adjoint variable \(\B \lambda\) as: \[ \B \lambda = \left(\frac{\partial \B F(\B u, \B p)}{\partial \B u}\right)^{-\dagger} \frac{\partial \B G}{\partial \B u}^{\dagger} \] we obtain the adjoint equation \[ \left(\frac{\partial \B F(\B u, \B p)}{\partial \B u}\right)^{\dagger} \B \lambda = \frac{\partial \B G}{\partial \B u}^{\dagger}. \]

Automatic differentiation (AD)

A general way of taking a program which computes a value, and automatically constructing a procedure for computing derivatives of that value, accurately to working precision, and using at most a small constant factor more arithmetic operations than the original program (Griewank et al. 2008)

Not finite differences / symbolic differentiation
Procedure:
1. Decompose original code into intrinsic functions (build computational graph)
2. Differentiate the intrinsic functions, effectively symbolically
3. Multiply together according to the chain rule
Automation:
- Source code transformation
- Operator overloading

Automatic differentiation (AD)

\(f: \mathbb{R}^M \rightarrow \mathbb{R}^N\)

Example: \(f(x_1,x_2) = x_1x_2 + \sin(x_1)\)

Forward mode

more efficient if \(N\gg M\)

Reverse mode

more efficient if \(M \gg N\)
need to store intermediate values

Finite Element Method

Open-source code

Finite Element Method

Implementation

Open source libraries with bindings for the python programming language using a custom code gyptis (Vial 2022).

Geometry and mesh generation: gmsh (Geuzaine et al. 2009)
FEM library: fenics using second order Lagrange basis functions (Alnæs et al. 2015)
Gradient calculations: dolfin-adjoint library with automatic differentiation (Mitusch et al. 2019)

gyptis

Finite Element Method

Application: Bi-focal lens

Objective: focal point at two different locations depending on the excitation frequency (Vial, Whittaker, et al. 2022) \[ \max_{p(\B r)} \quad \Phi = \left|E_1(\omega_1,\B r_1)\right| + \left|E_2(\omega_2,\B r_2)\right| \]

Optimization history

Finite Element Method

Inverse design of superscatterers

Objective: maximize the normalized scattering width (Vial and Hao 2022) \[ \max_{p(\B r)} \quad \Phi = \sigma_s/2R \]

TE

TM

Finite Element Method

Inverse design of superscatterers

Spectra

Quasi Normal Modes expansion

nannos

FMM benchmark

Application: metasurface

Objective: maximize the average of the transmission coefficient in the \((1,0)\) diffracted order for both polarizations: \[ \max_{p(\B r)} \quad \Phi = \frac{1}{2} \left( T^{\rm TE}_{(1,0)} + T^{\rm TM}_{(1,0)}\right) \]

Application: metasurface

Optimized metasurface

Plane Wave Expansion Method

Open-source code

Plane Wave Expansion Method

2D, possibly \(z\)-anisotropic materials in \(\varepsilon\) and \(\mu\), non dispersive
Polarization decouple, expand the \(z\) components as: \[ u(\B{r})=\sum_{\B {G}} u_{\B {G}}\, {\rm e}^{i(\B {k}+\B {G}) \cdot \B{r}}, \label{eq:pwem1} \]
After Fourier transforming Maxwell’s equations and recombining the relevant \(z\) component of the fields, we get the following generalized eigenproblem: \[ \mathcal{Q}^{\rm T} \,\hat{\tens{\theta}_\parallel}^{-1}\,\mathcal{Q}\, \Phi = k_0^2 \,\chi_{zz}\, \Phi \qquad(3)\] \(\tens{\theta}_\parallel=\tens{\mu}_\parallel\) for TM and \(\tens{\varepsilon}_\parallel\) for TE polarization, \(\mathcal{Q} = \left[\hat{k}_{y}, -\hat{k}_{x}\right]^{\rm T}\) and \(\Phi=\left[u_{\B{G}_{1}}, u_{\B{G}_{2}}, \ldots\right]^{\rm T}\)
Reduced Bloch Mode Expansion (Hussein 2009), only solving Equation 3 at symmetry points of the first Brillouin zone and performing a second expansion using those modes as a basis set.

protis

Photonic crystals: maximizing bandgaps

TE modes, square array with enforced \(C_4\) symmetry on the unit cell, \(\varepsilon_{\rm min}=1\) (air) and \(\varepsilon_{\rm max}=9\)

Objective: open and maximize a bandgap between the \(5^{th}\) and \(6^{th}\) eigenvalues:\[\begin{equation*}\max_{p(\B r)} \quad \Phi = \min_{\B k} \omega_{6}(\B k) - \max_{\B k} \omega_{5}(\B k)\end{equation*}\]

Final distribution in agreement with simple geometrical rules: the walls of an optimal centroidal Voronoi tessellation with \(n=5\) points (Sigmund et al. 2008)

Photonic crystals: dispersion engineering

TM modes, symmetry with respect to \(y\)

Objective: obtain a prescribed dispersion curve for the \(6^{th}\) band \[\begin{equation*}\min_{p(\B r)} \quad \Phi = \left\langle\left|\omega_{6}(k_x) - \langle \omega_{6}\rangle - \omega_{\rm tar}(k_x) \right|^2\right\rangle \end{equation*}\] with \[\begin{align*}\omega_{\rm tar}(k_x) =& -0.02 \cos(k_x a) + 0.01 \cos(2 k_x a) \\ &+ 0.007 \cos(3 k_x a)\end{align*}\] \(\langle f\rangle =\frac{1}{M}\sum_{m=0}^M f_m\)

Open source

Free software: low cost, portable, customizable, vendor-independent
Widely used programming language, is easily installable and integrates with the rich and growing scientific Python ecosystem
Reproducible and collaborative research
Auto-differentiation: inverse design of photonic structures and metamaterials with improved performances and explore intriguing effects

Get the code

Development on gitlab: continuous integration for testing and documentation deployment
Install / fork it / run it online / report bugs!

FEM

pip install gyptis

conda install -c conda-forge gyptis

FMM

pip install nannos

conda install -c conda-forge nannos

PWEM

pip install protis

Freeware list

https://github.com/joamatab/awesome_photonics

Thank you!

Appendix

Bibliography

Aage, Niels, Erik Andreassen, Boyan S. Lazarov, and Ole Sigmund. 2017. “Giga-Voxel Computational Morphogenesis for Structural Design.” Nature 550 (7674): 84–86. https://doi.org/10.1038/nature23911.

Alnæs, Martin, Jan Blechta, Johan Hake, August Johansson, Benjamin Kehlet, Anders Logg, Chris Richardson, Johannes Ring, Marie E. Rognes, and Garth N. Wells. 2015. “The FEniCS Project Version 1.5.” Archive of Numerical Software 3 (100). https://doi.org/10.11588/ans.2015.100.20553.

Bendsøe, M. P., and O. Sigmund. 1999. “Material Interpolation Schemes in Topology Optimization.” Arch. Appl. Mech. Ing. Arch. 69 (9-10): 635–54. https://doi.org/10.1007/s004190050248.

Bleyer, Jeremy. 2018. Numerical Tours of Computational Mechanics with FEniCS. Manual. Zenodo. https://doi.org/10.5281/zenodo.1287832.

Bradbury, James, Roy Frostig, Peter Hawkins, Matthew James Johnson, Chris Leary, Dougal Maclaurin, George Necula, et al. 2018. “JAX: Composable Transformations of Python+NumPy Programs.”

Geuzaine, Christophe, and Jean-François Remacle. 2009. “Gmsh: A 3-D Finite Element Mesh Generator with Built-in Pre- and Post-Processing Facilities.” Int J Numer Meth Engng 79 (11): 1309–31. https://doi.org/10.1002/nme.2579.

Granet, G., and B. Guizal. 1996. “Efficient Implementation of the Coupled-Wave Method for Metallic Lamellar Gratings in TM Polarization.” JOSA A 13 (5): 1019–23. https://doi.org/10.1364/JOSAA.13.001019.

Griewank, Andreas, and Andrea Walther. 2008. Evaluating Derivatives. Other Titles in Applied Mathematics. Society for Industrial and Applied Mathematics. https://doi.org/10.1137/1.9780898717761.

Harris, Charles R., K. Jarrod Millman, Stéfan J. van der Walt, Ralf Gommers, Pauli Virtanen, David Cournapeau, Eric Wieser, et al. 2020. “Array Programming with NumPy.” Nature 585 (7825): 357–62. https://doi.org/10.1038/s41586-020-2649-2.

Hussein, Mahmoud I. 2009. “Reduced Bloch Mode Expansion for Periodic Media Band Structure Calculations.” Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences 465 (2109): 2825–48. https://doi.org/10.1098/rspa.2008.0471.

Johnson, Steven G. 2007. “The NLopt Nonlinear-Optimization Package.” https://github.com/stevengj/nlopt.

Lalanne, Philippe, and G. Michael Morris. 1996. “Highly Improved Convergence of the Coupled-Wave Method for TM Polarization.” Journal of the Optical Society of America A 13 (4): 779–84.

Lazarov, B. S., and O. Sigmund. 2011. “Filters in Topology Optimization Based on Helmholtz-type Differential Equations.” International Journal for Numerical Methods in Engineering 86 (6): 765–81. https://doi.org/10.1002/nme.3072.

Liu, Victor, and Shanhui Fan. 2012. “S4 : A Free Electromagnetic Solver for Layered Periodic Structures.” Computer Physics Communications 183 (10): 2233–44. https://doi.org/10.1016/j.cpc.2012.04.026.

Maclaurin, Dougal, David Duvenaud, and Ryan P Adams. 2015. “Autograd: Effortless Gradients in Numpy.” In ICML 2015 AutoML Workshop, 238:5.

Mitusch, Sebastian K., Simon W. Funke, and Jørgen S. Dokken. 2019. “Dolfin-Adjoint 2018.1: Automated Adjoints for FEniCS and Firedrake.” Journal of Open Source Software 4 (38): 1292. https://doi.org/10.21105/joss.01292.

Molesky, Sean, Zin Lin, Alexander Y. Piggott, Weiliang Jin, Jelena Vucković, and Alejandro W. Rodriguez. 2018. “Inverse Design in Nanophotonics.” Nature Photonics 12 (11): 659–70. https://doi.org/10.1038/s41566-018-0246-9.

Paszke, Adam, Sam Gross, Soumith Chintala, Gregory Chanan, Edward Yang, Zachary DeVito, Zeming Lin, Alban Desmaison, Luca Antiga, and Adam Lerer. 2017. “Automatic Differentiation in PyTorch.” In NIPS-W.

Paszke, Adam, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, et al. 2019. “PyTorch: An Imperative Style, High-Performance Deep Learning Library.” In Advances in Neural Information Processing Systems 32, edited by H. Wallach, H. Larochelle, A. Beygelzimer, F. dAlché-Buc, E. Fox, and R. Garnett, 8024–35. Curran Associates, Inc.

Sigmund, Ole, and Kristian Hougaard. 2008. “Geometric Properties of Optimal Photonic Crystals.” Physical Review Letters 100 (15): 153904. https://doi.org/10.1103/PhysRevLett.100.153904.

Svanberg, Krister. 2002. “A Class of Globally Convergent Optimization Methods Based on Conservative Convex Separable Approximations.” SIAM Journal on Optimization 12 (2): 555–73. https://doi.org/10.1137/S1052623499362822.

Vial, Benjamin. 2022. “Gyptis.” Zenodo. https://doi.org/10.5281/zenodo.6636134.

Vial, Benjamin, and Yang Hao. 2022. “Open-Source Computational Photonics with Auto Differentiable Topology Optimization.” Mathematics 10 (20): 3912. https://doi.org/10.3390/math10203912.

Vial, Benjamin, Tom Whittaker, Shiyu Zhang, William G. Whittow, and Yang Hao. 2022. “Optimization and Experimental Validation of a Bi-Focal Lens in the Microwave Domain.” AIP Advances 12 (2): 025103. https://doi.org/10.1063/5.0074062.

Virtanen, Pauli, Ralf Gommers, Travis E. Oliphant, Matt Haberland, Tyler Reddy, David Cournapeau, Evgeni Burovski, et al. 2020. “SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python.” Nature Methods 17: 261–72. https://doi.org/10.1038/s41592-019-0686-2.

Wang, Fengwen, Boyan Stefanov Lazarov, and Ole Sigmund. 2010. “On Projection Methods, Convergence and Robust Formulations in Topology Optimization.” Struct Multidisc Optim 43 (6): 767–84. https://doi.org/10.1007/s00158-010-0602-y.

Whittaker, D. M., and I. S. Culshaw. 1999. “Scattering-Matrix Treatment of Patterned Multilayer Photonic Structures.” Physical Review B 60 (4): 2610–18. https://doi.org/10.1103/PhysRevB.60.2610.

Optimization of Photonic Devices: Implementation of Auto-Differentiable Numerical Methods in Open-Source Software

Introduction

Introduction

Topology optimization: recipes

Topology optimization: recipes

Algorithm

Computing gradients

Computing gradients

Finite differences

Computing gradients

Tangent linear equation

Adjoint equation

Computing gradients

Adjoint equation

Automatic differentiation (AD)

Automatic differentiation (AD)

Finite Element Method

Finite Element Method

Implementation

gyptis

Finite Element Method

Application: Bi-focal lens

Optimization history

Finite Element Method

Inverse design of superscatterers

TE

TM

Finite Element Method

Inverse design of superscatterers

Spectra

Quasi Normal Modes expansion

Fourier Modal Method

Fourier Modal Method (FMM)

Implementation

nannos

FMM benchmark

Application: metasurface

Application: metasurface

Optimized metasurface

Plane Wave Expansion Method

Plane Wave Expansion Method

protis

Photonic crystals: maximizing bandgaps

Photonic crystals: dispersion engineering

Open source

Open source

Get the code

Freeware list

Thank you!

Appendix

Bibliography