RadLab data products enhancements

Hello all,

I am Rutuja Gurav and I was part of the FDL team that used RadLab data to forecast radiation exposure using ML. I have been in communication with @kirill who invited me to join this WG and he mentioned that moving forward the RadLab AWG wants to focus on applications alongside data acquisition efforts.

To that effect, based on my experience of using RadLab data for ML, @kirill and I discussed a couple of considerations for the next version of RadLab.

  1. Some application-agnostic data cleaning.
  2. Including a “Segments” table in the DB providing info about data availability and gaps on a per-instrument basis.

I have described this in some detail in these slides. Please reach out to me if this topic is of interest.

1 Like

Thanks for joining Rutuja! Tagging a few other @RLWG members: @j_miller @svcostes @ambrozova @calexyoung @l.lunati @vcdaoust

If you all aren’t familiar, @rutujagurav was part of this past summer’s team, described in a separate post:

1 Like

Hi @rutujagurav, happy to see you here! Thanks for a very fruitful discussion that we had. I have the segments table on my todo list now (it’s gonna benefit both the users and us internally, actually). As for application-agnostic data cleaning, this can undoubtedly be a value add. Data PIs already perform a level of cleaning before the data gets into the database (so – and that’s just semantics – actually what you’re referring to as “level 0” is, technically, already level 2). But preprocessing specifically for ML applications and having a common target spec for that is important – please keep me in these discussions and we welcome all suggestions!

2 Likes

When is the next RadLab @RLWG meeting? Is this summer challenge with Helio data going to be presented to an AWG anytime soon?

I’d love if you @kirill & @rutujagurav present the progress at the October ALSDA AWG (Oct 15, 8-9:30a) - you should see it on your Forum calendars

1 Like