Training a DNN with stochastic inputs with analysis of the robustness against input process and weight matrix fluctuations

x(t) € is the input process. y(t) € is the desired output process. It may for example be obtained from the input via nonlinear distortions with additive noise:

y(t) = ^(^(t)) + v(t)

The DNN output , assuming that it has just one layer is

z(t) = f(Wx(t) + b)

where W e RJWxd is the weight matrix, b € R‘v/ is the bias vector and f : R‘v/ -> is a nonlinear map. If it has two layers then the output will be of the form

z(t) = f2(W2f1(W1x(t) + + b2)

More generally, if it has L layers, then

Z(t) = /L(lVL/L-l(WT-l/£-2(..J2(VV2/l(IVl^(i) + h) + M-) + *>l)

We write this as

z(t) = f(W,b,x(t))

where

W = (W1,...,WL),fe=(b1,...,6L)

Suppose the bias b is fixed so we can write

z(t) = f(W,x(t))

We choose W so that

E[G/(t)-z(t))2]

is a minimum. Assume that this minimum is attained when W = Wo. Now suppose Wo gets perturbed slightly to Wo + JW and simultaneously the input process x(t) gets perturbed slightly to x(t) + 8x(t). We wish then to design the weight matrix perturbation fiW so that the mean square error E(y(t) — /(Wo + fiW, x(t) + fe(t)))2 is still a minimum. Using Taylor series, we expand

/(Wo + d'W, x + fe) = ea;p((JW Vlv) + (fe, V,))/( Wo, x)

« /(Wo, rro) + [(¿W, VW) + (fe, VJ]/(Wo, ¿’o)

+(1/2)[(<5W, Vvv) + (fix, Vx)]2f(W0,x0)

In this approximation, we have retained only upto quadratic order terms in <5W, fix. Then,

E(d'W) = E(y(t) - /(Wo + SW, x(t) + fe(t)))2

« E[(e(t) - ((¿W, Vvv) + (fe(t), Vx.))/(w0, x(t)) -(1/2)[(d'W, Vlv) + (fix, VJ]2/(W0, x(t))]2

Quantum Boltzmann equation

An approximate derivation. The mixed N particle state p(t) Є is

assumed to have all same marginals of any given order. Thus, if k > 1 and ¿i < ¿2 < ••• < ik < N and (Ji,..., jN-i) is the complement of {«i,..., ik), then we are assuming that

Pi2...k(t) = Trk+1....,N(p(.t)) = Trjl.....jN_k)(p(t))

The Hamiltonian of the system is

N

Н = ^На+ E

a=l l

where Ha acts in Ha while Vah acts in Tia Here, 'Ha,a = 1,2, ...,N are identical copies and so are Vai>, 1 < a < b < N. The Schrodinger-Von-Neumann evolution equation for the density pit) is

ip ) = [H,p(t)]

This gives on taking partial trace over 2,3,.... N,

ip'^t) = [H^p^t)] + (AT - l)Tr2[V12,P12(t)]

Again taking the trace over 3,4,..... N gives

*Ріг(0 = l^i + H2 + Vi2,pi2(t)] + (AT — 2)Тгз[1із + І23,Р12з(і)]

We write

Pi2Pi(0 <8>Pi(t) + 912(0,

P123 = P12 ® Pl + 9123 = Pl ® pl <8> pl + 912 ® Pl + 9123

Then,

*/12(0 = Up'i ® pl + Pi ® Pl + 912') =

[Hi + H‘2 + Vi2, p ® pi + 9i2] + (A — 2)Тгз[Тіз + +23,pi2 ® pi + 9133]

Using the equation of motion for pi, this gives the following exact equation followed by the approximate equation based on treating 9123 as small and of the second order of smallness as compared to Vai>,

19'12 = [Hi + H2 + +12,912] + (Лг — 2)Тгз[кіз + +23,912 ® pi + 9123]

~ [Hi + H2 + +12,912] + (A — 2)Тгз[Уіз + +23, Pi ® pi ® pi +912 <8> pi]

= [Hi + H2 + +12,912] + (Ar - 2)7V3[Vi3 + +23,912 ® Pi]

If we no regard 912 as the same order of smallness as Vab, then this last equation further approximates to

*P12 №1 + H2 + +12,912]

which gives

£12^) = exp{-itad{Hx + H2 + V12))(pi2(0)) = Ti2(t)(gi2(0))

say. Then, the equation for pi becomes with this approximation

ip'i = [Hi, pi] + (N - l)Tr2[Vi2,pi ® pi + 512]

= [Hi, pi] + (N - l)Tr2[Vi2,pi <8> pi] + (H - l)Tr2[V12,T12(i)(i?i2(0))]

This may be termed as the quantum Boltzmann equation.

List of Ph.D scholars supervised by Har-ish Parthasarathy with a brief summary of their theses

[1] S.N.Sharma: Applications of nonlinear filtering theory to certain problems in classical mechanics.

(NSIT-DU, 2004)

Summary: The stochastic two body gravitational problem in the presence of interplanetary dust modeled using Brownian motion has the form

r"(t) - r(t)0'[1] (t) = -GM/r1 + A(t),

r(t)6»"(i) + = fe(t)

where fr,fe are white noise processes, ie, formal differentials of Brownian motion. These equations are cast in the from of four coupled nonlinear Ito stochastic differential equations and mean and covariance propagation equations for this system are obtained by expanding the variables r(t), 0(t), r'(t), 0'(t) about their mean values and applying Ito’s formula for Brownian motion to obtain the covariance propagation equations. Further, by taking noisy measurements on the position of the body, the EKF is applied to obtain real time estimates of the trajectory. Sharma then applies the Kushner nonlinear filtering theory to the nonlinear Van-der-Pol and Duffing oscillator taking cubic corrections into account, ie, he goes a step further than the EKF which is based on expanding upto quadratic terms. More accurate filtering results are obtained by considering the joint evolution of the first three conditional moments.

phase algorithms using stochastic differential equations driven by Brownian motion.

(NSIT-DU, 2009)

[4] Sudipta Majumdar:Modeling and parameter estimation in nonlinear transistor circuits using Volterra approximations combined with wavelet based compression for data storage for the purpose of estimation.

(NSIT-DU, 2010)

[5] Arthi Vaish:Finite element method for determining the modes in waveguides having various kinds of cross section and with inhomogeneous and anisotropic media filling the guide taking into account background gravitational perturbations in the form of a curved space-time metric.

(NSIT-DU, 2011)

[6] Akash Rathee:Study of higher harmonic generation in nonlinear transistor circuits using Fourier series and perturbation theory.

(NSIT-DU, 2011)

[7] Raj eev Srivastava: Image modeling, smoothing and enhancement using partial differential equations with emphasis on diffusion equations with intensity dependent diffusion matrix coefficient.

(NSIT-DU, 2009)

[8] Rajveer S.Yaduvanshi: Magneto-hydrodynamic antenna construction analysis using Navier-Stokes and Boltzmann kinetic transport equation.

(NSIT-DU, 2010)

[9] Lalit Kumar: Studies in transmission line and waveguide analysis taking hysteresis and capacitive nonlinearities and quantum mechanical effects of transmission line and waveguide fields on atoms and quantum harmonic oscillators.

(NIT, 2018)

[10] Kumar Gautam: Quantum gate design by perturbing real quantum systems with electromagnetic fields.

(NSIT-DU, 2017)

[11] Rohit Singla: Studies in robot trajectory tracking and dynamic parameter estimation in the presence of noise and in master-slave teleoperation based tracking using adaptive control algorithms.

(Yet to submit, has published three technical papers in the impact factor-4 Springer journal ’’Nonlinear Dynamics”)

[12] Navneet Sharma: Quantum parameter estimation using search algorithms with applications to quantum communication and quantum gate design.

Given an unperturbed Hamiltonian Hq and a time dependent perturbation operator V(t) = ^fcVfc(t), the problem is to estimate the parameter vector 0 = (0fc) based on measuring observables Xa, a = 1, 2,..., n on the system state at different times taking into account the collapse postulate of quantum mechanics. For example, let U(t|0) denote the unitary evolution:

p

iU'(t, s|0) = (Ho + E Wt))U{t, s|0), t > s, k=l

U(s,s) = I

Let {Ma : a = 1,2, ..., r} be a POVM. Then, if p(0) is the initial state of the system, the probability of measuring the outcomes a-i,02,...,as at times ¿1 < Î2 < ••• < ts is given by

F(ai, ...,as,ti,...,ts) =

where

Ea = x/M^.

We then estimate 9 by applying the maximum likelihood method to this probability.

(Yet to defend, all examiner reports recommend award of degree)

[13] Pravin Malik: Antenna design using numerical solution of integral equations arising from the boundary conditions on the antenna surface.

[14] Manisha Khulbe: Studies in electromagnetic wave propagation in inhomogeneous, anisotropic and field dependent (nonlinear) media with applications to estimating the medium parameters from discrete measurements of the electromagnetic field at different space-time points. Applications to antenna design for wave propagation in nonlinear media are also considered. For this, perturbative expansions of the medium permittivity and permeability as Taylor series in the electric and magnetic fields as well as expansion of the field independent coefficients of the permittivity and permeability in terms of basis functions is performed. These expansions are substituted into the Maxwell equations to obtain a sequence of linear equations for each perturbative order. Boundary conditions of the electromagnetic fields on the antenna surface are applied to derive integral equations for the induced surface current density.

  • [1] Vipin Behari Vats: Parameter estimation algorithms in nonlinear systems using nonlinear LMS algorithm with a study of the behaviour of the Lyapunov exponents of autonomous nonlinear systems for small initial perturbations around a fixed point. (NSIT-DU, 2007) 2 Tarun K.Rawat: Applications of stochastic nonlinear filtering theory to trajectory maneouvering of spacecrafts and convergence analysis of least mean
 
Source
< Prev   CONTENTS   Source   Next >