GSoC 2025 Project #18 :: Automatic Bayesian Inference over Virtual Brain Models using Probabilistic Programming Languages (350h)

arnab1896 · March 5, 2025, 6:06pm

Mentors: Meysam Hashemi <meysam.HASHEMI@univ-amu.fr> and Daniele Marinazzo <daniele.marinazzo@gmail.com>

Skill level: Intermediate/advanced

Required skills: At least intermediate coding skills. Comfortable with Python and JAX.

Experience with probabilistic inference and tools such as NumPyro or PyMC would be beneficial.

Time commitment: Full time (350 hours)

About: Bayesian inference is indispensable for hypothesis testing and uncertainty quantification in understanding complex brain (dys)function and cognition. Meanwhile, Virtual Brain Models, implemented through neuroinformatic tools like The Virtual Brain (TVB), have gained significant popularity due to their potential for clinical translation. Bayesian inference on Virtual Brain Models translates into probabilistic estimation of latent and observed states within systems driven by network input and stimuli, modeled by high-dimensional nonlinear differential equations, with potentially correlated parameters. To address these challenges, advanced MCMC sampling and inference algorithms embedded in Probabilistic Programming Languages (PPLs) have shown remarkable results. In particular, gradient-based algorithms, such as the No-U-Turn Sampler, and automatic Laplace approximation, have demonstrated effectiveness in achieving reliable Bayesian inference even in the presence of multimodal parameter distributions. These methods have been successfully applied in contexts like dynamical causal modeling of evoked-related potentials (see here) and inferring seizure propagation at the whole-brain scale (see here). However, significant challenges persist for fMRI BOLD data inference at whole-brain scales, e.g., in resting-state. Addressing these challenges requires reparameterization techniques to improve convergence and implementation in high-level tools like NumPyro or PyMC, streamlining the inference process. The aim of this project is to extend the automatic Bayesian estimation methods available in PPLs such as NumPyro or PyMC to enable Bayesian estimation of bifurcation parameters from fMRI data at large scales. The project leverages existing Python packages and prior expertise to advance inference methods for large scales dynamical models. By leveraging existing Python packages and prior expertise, the expected outcomes aim to significantly advance Bayesian inference for VBMs, addressing current limitations and enhancing their scalability for studying brain dynamics and supporting clinical applications.

Aims:

Improving the implementation of Virtual Brain Models using JAX-based frameworks, such as NumPyro or PyMC, to improve efficiency and scalability, validated in-silico
Develop reparameterization techniques to decorrelate parameters, facilitating improved gradient calculations for convergence, and visualizing outcomes
Benchmark existing algorithms within NumPyro or PyMC to systematically identify their strengths and weaknesses in handling high-dimensional, multimodal problems
Monitor algorithm convergence and provide comprehensive guidelines for ensuring reliable inference in the presence of multimodal parameter distributions

Website: The Virtual Brain: Delivering practical results. For novel clinical applications.

Tech keywords: Python, probabilistic ML/AI

aarkim · March 6, 2025, 6:13am

Hello Dr. @mhashemi and Dr. @Daniele_Marinazzo,

My name is Aaron Kim, and I am a third-year Applied Mathematics student at Texas A&M University. I am particularly interested in probabilistic modeling, Bayesian inference, and computational neuroscience, and I would love the opportunity to contribute to your project.

To provide some background, I have experience with Python, JAX, NumPyro, machine learning, and data analysis. At the Brain Networks Laboratory at Texas A&M, I am assisting with research on computational modeling of neural architectures, including recurrent and dual-pathway CNNs inspired by the V1 cortex. Additionally, my work at the Air Force Research Laboratory involves time-series anomaly detection and sequential data modeling, where I have gained exposure to probabilistic inference techniques and time-series forecasting.

I’m eager to learn more and prepare, whether through research papers, existing implementations, or preliminary materials. I’d appreciate your guidance on the best way to get started.

Best regards,
Aaron Kim

mhashemi · March 6, 2025, 11:26am

To be more precise,

We have implemented a DCM model in NumPyro/PyMC to use Hamiltonian Monte Carlo and Laplace methods:

DCM_ERP_PPLs Repository

The main goal is to scale this work to larger and more complex models for fMRI, such as the Montbrio model.

This model has already been implemented in another tool using simulation-based inference method:

Montbrio Implementation Example

Our objective is to implement a whole-brain network of Montbrio model in NumPyro or PyMC, and estimate parameters such as global coupling G parameter. Simply said:

reproduce Figure S8 from this paper, but using NumPyro/PyMC.

Let me know if you’re interested or have any questions!

aarkim · March 7, 2025, 1:24am

Hi Dr. @mhashemi,

Thank you for the clarification and for sharing these resources! I’m very interested in working on this.

To start, I’ll review the DCM_ERP_PPLs repository and the Montbrio implementation example to understand the existing workflows. Are there specific parts of the codebase or documentation that you’d recommend I prioritize to align with the project’s main objectives?

mglushanina · March 16, 2025, 10:55pm

Hello Dr. @mhashemi and Dr. @Daniele_Marinazzo,

My name is Mariia Glushanina, I am a first-year master’s student in École Normale Supérieure in Masters of Cognitive Science. My main interest lies in the probabilistic computational models of cognitive functions, but I am also interested in developping computational tools for neuroscience research.

Before doing masters, I also did a bachelor’s degree in Cognitive Science in Saint-Petersburg State University, as well as online degree on Data Science and Artificial Intelligence in Moscow State University. In the latter, I gained extensive training in machine learning, data analysis and required math prerequisites for this fields. During the course on Probabilistic Graph Models I got acquinted with PyMC framework and I am eager to contribute to the project, using this and new tools. Throughout my bachelor years, I was involved in project of analysing EEG and ECoG data.

Furthermore, currently in my master’s I’m following courses on neuroimaging methods and continue to deepen my knowledge in building computational models. Hope that my skills could be useful for this kind of project

Best regards,
Mariia Glushanina

mhashemi · March 31, 2025, 1:23pm

The focus is the Montbrio model operating at bistable regiom;

The main interest here is the implementation of coupled systems (Montbrio SDE model using Numba — vbi 0.1.3.3.dev6+gaaa883e documentation) but in Numpyro/Pymc.

More importantly, to get hands on Nympro models, e.g,

or Pymc

DCM_ERP_PPLs/PyMC at main · ins-amu/DCM_ERP_PPLs · GitHub.

The examples of these tools around the implementation of differential equations are recom
mended.

mhashemi · March 31, 2025, 1:24pm

thanks. Please see above for more details.

Here a preliminary task could be to run a whole-brain model of Montbrio eqs

using SBI-VBM repo or VBI toolkit to get the results in Fig S1 as in this work.

katerinaeleftheriadi · April 4, 2025, 3:30pm

Hi Dr. @mhashemi and Dr. @Daniele_Marinazzo,

I’m Katerina Eleftheriadi, a Master’s student in Cognitive Neuroscience with experience in MEG signal analysis and computational modeling. I’m especially interested in probabilistic inference and how it can improve our understanding of large-scale brain dynamics.

Your project on Bayesian inference in Virtual Brain Models really resonates with my background and interests. I’ve started exploring the DCM_ERP_PPLs repository and the Montbrio model, as you suggested, and I’d love to contribute to porting the model into PyMC/NumPyro

Best regards,

Katerina

Tirtha_Haque · April 5, 2025, 12:03am

@mhashemi as per there isn’t any git repo mentioned related this project so I want to know if the integration of VBM has been done using tvb-lib . And I also liketo if there any specific points to mention in the proposal . Thank You for your time

mhashemi · April 7, 2025, 2:19pm

Please see above for open-source repos. It is not planned to integrate the tvb-lib. Rather VBM requires data features for training, which can be simulated using TVB.

Tirtha_Haque · May 21, 2025, 1:17pm

@mhashemi I still like to work on it as a open source contributor is that option is still available