Chengping Ro, Ho Sun*, Yng Liu,*
a Department of Mechanical and Industrial Engineering, Northeastern University, Boston, MA 02115, USA
b Department of Civil and Environmental Engineering, Northeastern University, Boston, MA 02115, USA
c Department of Civil and Environmental Engineering, MIT, Cambridge, MA 02139, USA
Keywords:Physics-informed neural networks(PINN)Deep learning Fluid dynamics Incompressible laminar flow
ABSTRACT Physics-informed deep learning has drawn tremendous interest in recent years to solve computational physics problems, whose basic concept is to embed physical laws to constrain/inform neural networks, with the need of less data for training a reliable model. This can be achieved by incorporating the residual of physics equations into the loss function. Through minimizing the loss function, the network could approximate the solution. In this paper, we propose a mixed-variable scheme of physics-informed neural network (PINN) for fluid dynamics and apply it to simulate steady and transient laminar flows at low Reynolds numbers. A parametric study indicates that the mixed-variable scheme can improve the PINN trainability and the solution accuracy. The predicted velocity and pressure fields by the proposed PINN approach are also compared with the reference numerical solutions. Simulation results demonstrate great potential of the proposed PINN for fluid flow simulation with a high accuracy.
Deep learning (DL) has attracted tremendous attentions in recent years in the field of computational mechanics due to its powerful capability in nonlinear modeling of complex spatiotemporal systems. According to a technical report [1] by U.S.Department of Energy, a DL-based approach should be featured with the domain-aware, interpretable and robust to be a general approach for solving the science and engineering problems. Recent studies of leveraging DL to model physical system include,just to name a few [2-6]. These applications can be categorized into two types based on how the DL model is constructed: in either data-driven or physics-informed manner. In a data-driven framework, the DL model is constructed as a black-box to learn a surrogate mapping from the formatted input x ∈Rmto the output y ∈Rn. The exceptional approximation ability of deep neural network (DNN) makes possible to learn the mapping even when the dimensionality m and n are very high. The training dataset { x,y}, typically very rich, can be obtained by conducting high-fidelity simulations using exact solvers (e.g., see Refs.[3, 4, 7]). Nevertheless, obtaining a rich and sufficient dataset from simulations for training a reliable DL model is computationally expensive and requires careful case design. To address this fundamental challenge, physics-informed DL explicitly embed the physical laws (e.g., the governing partial differential equations (PDEs), initial/boundary conditions, etc.) into the DNN, constraining the network's trainable parameters within a feasible solution space. The objective of exploiting physical laws in DNN is assumed to (1) reduce the large dependency of the model on available dataset in terms of both quality and quantity,and (2) improve the robustness and interpretability of the DL model. In this regard, DNN essentially has the capacity of approximating the latent solutions for PDEs [8, 9], with distinct benefits summarized as follows: (1) the superior interpolation ability of DNN, (2) the approximated solution has a close form with its infinite derivative continuous, and (3) state-of-the-art hardware advances make the numerical implementation and parallelization extremely convenient.
More recently, Raissi et al. [5, 6] introduced a general framework of PINN and demonstrated its capacity in modeling complex physical systems such as solving/identifying PDEs. A huge difference from some of the previous studies is that, in addition to the physical laws, the PINN can also exploit the available measurement data, making it possible to discover the systems whose physics are not fully understood. In particular, seminal contributions of using PINN to model fluid flows have been made recently. For example, Kissas et al. [10] employed PINN to predict the arterial blood pressure based on the MRI data and the conservation laws. Sun et al. [11] proposed a PINN approach for surrogate modeling of fluid flows without any simulation data. Zhu et al. [12] proposed a physics-constrained convolutional encoder-decoder network and a generative model for modeling of stochastic fluid flows.
In this paper, we formulate a mixed-variable PINN scheme for simulation of viscous incompressible laminar flows without any measurement data. The remaining this paper is organized as follows. We will introduce the methodology of the mixed-variable PINN and the mathematical formulation for fluid dynamics.The steady and transient laminar flows passing a circular cylinder will be modeled using the proposed PINN scheme without any simulation data. A comparison study is made to demonstrate the improved solution accuracy and network trainability by the proposed scheme.
Let us consider the incompressible Newtonian flow governed by the following Navier–Stokes equations:
where ? is the Nabla operator, v =(u, v) is the velocity vector, p is the pressure, μ is the viscosity of the fluid, ρ is the density of fluid and g is the gravitational acceleration. When leveraging PINN to solve the aforementioned PDEs, minimizing a complex residual loss resulted from Eq. (2) is intractable due to its complex form with multiple latent variables (e.g., v and p) and high-order derivatives (e.g., ?2). In order to design an easily trainable PINN, we convert the Navier–Stokes equation in Eq. (2)to the following continuum and constitutive formulations:
where σ is the Cauchy stress tensor and p =?trσ/2. The benefits of using the continuum-mechanics-based formulation are twofold: (1) reducing the order of derivatives when a mixed-variable scheme in PINN is used and (2) improved trainability of DNN as found in the comparison of numerical results.
The proposed mixed-variable scheme is used in this paper to solve the aforementioned PDEs (see Eqs. (1), (3) and (4)) that govern the laminar flow dynamics. The salient feature of PINN is that the physical fields are approximated globally by a DNN. In free condition of the flow. In this way, the continuity equation will be satisfied automatically. For a two-dimensional problem,the velocity components can be computed by particular, the DNN maps the spatiotemporal variables { t,x}Tto the mix-variable solution { ψ, p,σ}, where the stream function ψ is employed rather than the velocity v to ensure the divergence[u,v,0]=?×[0,0,ψ]. Note that v =[u,v] is taken as the latent vari-
able. The automatic differentiation is used to obtain the partial derivatives of the DNN output regarding the time and space(e.g., t , x and y). The loss function is composed of the data loss Jd(if measurements are available) and the physics loss Jp. The physics loss Jpis the summation of the governing equation loss Jgand the initial and boundary condition loss Ji/bc, given by
where r(·)denotes the residual, | |·|| denotes the ?2norm and N(·)denotes the number of collocation points (subscripts g for governing equation, I for initial condition, n b for Neumann boundary, and d b for Dirichlet boundary). The total physics loss Jpis defined as
where β >0 is a user-defined weighting coefficient for initial and boundary condition loss. Noteworthy, having the measurement data makes the fluid flow modeling data-driven, which is however not a prerequisite. The architecture of the proposed PINN for fluid dynamics simulation is presented in Fig. 1. In this paper, no measurement data from simulations or physical experiments is used for training the PINN.
In this section, we employ the proposed PINN to model the steady and transient flows passing a circular cylinder. A parabolic velocity profile is applied on the inlet while the zero pressure condition is applied on the outlet, as shown in Fig. 2. Non-slip conditions are enforced on the wall and cylinder boundaries.The gravity is ignored in both two cases. The proposed PINN is implemented on the TensorFlow [13] and the source codes can be found in https://github.com/Raocp/PINN-laminar-flow.
For the steady case, the dynamic viscosity and density of the fluid is 2×10-2kg/(m·s) and 1 kg/m3respectively. The normal velocity profile is defined as
Fig. 1. Architecture of the physics-informed neural network for fluid dynamics. Note that α is a user-defined weighting coefficient. w and b are weights and biases for the DNN. The constraint of initial and boundary conditions can be converted as residuals adding to the loss function based on Lagrangian multipliers. The data loss Jd is present only when data is available.
Fig. 2. Diagram of the computation model
with Umaxequal to 1.0 m/s which results in a small Reynolds number so that the flow is dominated by laminar flow. A total number of Ng=50000 collocation points, which includes Ndb=1200 Dirichlet boundary (cylinder, wall, inlet) points and Nnb=200 Neumann boundary (outlet) points, are generated using Latin hypercube sampling (LHS) for the training the network. It should be noted that the collocation points are refined near the cylinder to better capture the details of the flow.A grid search strategy is used to find an optimal combination of depth and width for the network. The relative ?2error defined by
is used as the metric for comparison, where f is the physical quantity of interest, and M is the total number of reference points. Adam [14] and Limited-memory BFGS (L-BFGS)optimizer [15] is employed to train the DNN due to their good convergence speed demonstrated in the tests. We also implement the traditional scheme for fluid dynamics employed in [5,6] where the stream function and pressure { ψ, p} act as the output variables. From the relative ?2errors of the velocity field (see Table 1), it can be seen that the network of 8×40 achieves the best result among all the configurations. The mixed-variable PINN improves the accuracy of numerical results over the traditional PINN.
The predicted velocity and pressure fields by the PINNs with mixed-variable and traditional scheme are shown in Fig. 3(b, c).
Table 1 Relative ?2 errors (unit: 1 0?2) of the velocity field for different DNN configurations with β =2 (left: the traditional scheme; right: the proposed mixed-variable scheme)
The reference solution is obtained from the ANSYS Fluent 18.1 package (finite volume-based) [16] (see Fig. 3a). It can be observed that the PINN with the traditional scheme fails to model the flow. In particular, the traditional scheme fails to enforce the non-slip condition on the lower and upper boundaries.However, the steady velocity and pressure fields are well reproduced by the PINN with mixed-variable scheme. It is worth mentioning that the pressure distribution on the cylinder surface is typically of interest for computing the resultant drag and lift forces. Therefore, we compare the pressure distributions obtained by two types of PINN and ANSYS Fluent as shown in Fig.4. The overall agreement between the mixed-variable PINN and ANSYS Fluent is very good.
We also compare the performance of these two schemes under various β which controls the weight of the boundary condition loss. As shown in Fig. 5, the convergence of the traditional scheme is significantly affected by β, though the final loss can be reduced by increasing the value of β up to 10. However, the mixed-variable scheme yields consistent results for various β.The improvement by the mixed-variable scheme is thanks to the reduced order of derivatives required to construct the loss function, in comparison with the traditional scheme [5, 6], which makes the optimization problem easier.
The transient flow with the same computation domain depicted in Fig. 2 is considered in this case. The dynamic viscosity of the flow is μ=5×10?3kg/(m·s) while the density is ρ=1 kg/m3. The time duration for the modeling is 0.5 s. Three virtual pressure probes P1(0.15, 0.2) m, P2(0.2, 0.25) m and P3(0.25, 0.2) m are installed on the surface of the cylinder. The flow is initially still while a time-varying parabolic inlet velocity profile is applied subsequently, which is defined as
Fig. 3. Velocity and pressure fields of the steady flow passing a circular cylinder. a Reference solution from ANSYS Fluent. b Mixed-variable scheme solution with 8×40 network. c Traditional scheme solution with 8×40 network. The hyperparameters and collocation points for training these two PINNs are kept same.
Fig. 4. Distribution of pressure on cylinder. Network of 8×40 and β=2 are used.
Fig. 5. Comparison of convergence curves with respect to coefficient β. Network of 8×40 is used in all the cases. 10000 iterations trained with Adam optimizer followed by L-BFGS optimizer
Fig. 6. Transient normal velocity profile.
where Umaxequals to 0.5 m/s and the period T is 1.0 s. The remaining boundary conditions are the same as those in the previous example. The inflow velocity as a function of t and y is visualized in Fig. 6. The width and depth of the network are selected to be 50 and 7 respectively while the coefficient β is set to be 2. A total number of Ng=120000 collocation points, which include Ndb=9600 points on cylinder, wall and inlet boundaries,Nnb=3200 points on outlet, and NI=3500 points at initial time,are used to train the network.
Fig. 7. Snapshots of the PINN-predicted transient flow fields passing a circular cylinder
Fig. 8. Pressure time histories on a P1, b P2 and c P3 probes.
Three snapshots of the predicted flow fields are presented in Fig. 7 which shows the evolution of the flow as the inlet velocity increases over time. The reference flows obtained by ANSYS Fluent are not shown here since the PINN-predicted result matches very well with them. The pressure time histories on three probes obtained from the proposed PINN are compared with those from ANSYS Fluent, as depicted in Fig. 8. It can be seen that the proposed PINN approach can well predict the pressure time histories in a transient flow.
We propose a mixed-variable PINN scheme for modeling fluid flows, with particular applications to incompressible laminar flows. The salient features of the proposed scheme include (1)employing the general continuum equations together with the material constitutive law rather than the derived Navier–Stokes equations, and (2) using stream function to ensure the divergence free condition of the flow in a mixed-variable setting. The comparison study indicates the benefits (high accuracy and good trainability) of the proposed mixed-variable scheme. In both the steady and transient flow cases, the result produced by the PINN shows a good agreement with the reference numerical solutions.
It is notable that the applications in this paper are limited to the laminar flows at low Reynolds numbers, although the approach is in theory applicable to turbulent flows at large Reynolds numbers. However, it requires discretizing the computation domain with much finer collocation points which will lead to computer memory issues and drastically increase the computational cost. Our future work aims to address this challenge by developing a "divide-and-conquer" training scheme in the context of transfer learning, that is to divide the time domain into multiple steps and re-train the network partially while fixing the weights and the biases from the previous step [17].
Theoretical & Applied Mechanics Letters2020年3期