Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About Equation (6) in the paper #1

Open
ZhaoRunyi opened this issue Jul 25, 2024 · 1 comment
Open

About Equation (6) in the paper #1

ZhaoRunyi opened this issue Jul 25, 2024 · 1 comment

Comments

@ZhaoRunyi
Copy link

ZhaoRunyi commented Jul 25, 2024

Hi! I'm reading your paper and really facinated by your idea that controllablity measured in Gramian form equals the trajectory covariance, it expose a promising way to take the reinforcement learning problem as traditional control theory problem.
However, I stacked in your proof on this idea, namely the Eqution (6) shown in the screenshot below in the Supplementary notes 2.1 of your paper. The eqution in red frame confused me as it change product of 2 integral into 1 integral with the integrand simply multiply.

2

I try to use the Green's theorem to prove it, only to find that the P&Q is too hard to construct.
I am new in the field and I don't konw if it's appropriate to ask question not related to the code through Github Issue, but really can't find another way to contact with you. If this Issue bother you please let me know.
Hope to hear from you soon and best regards!

@GabrielJardimPP
Copy link

@ZhaoRunyi I believe this is Ito's isometry, but I am unsure because the notation is nonstandard for Ito processes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants