• Home
  • About
    • Anirban photo
    • About Me
    • Email
  • Blog Posts
    • Writings
    • Tags
  • Skill Set

data.table work diary: Mar 18 - 22

  • Mar 18, ‘24:

1) After some debugging, I solved the issues with the other examples that Doris worked on.

2) Added more test cases for checking on historical performance regressions in data.table.

3) PR #3 on my fork replicates the case for #4440.

  • Mar 19, ‘24:

1) Create a blog post (the file) regarding my work on the GitHub Action, to be published @Raft.

2) Trying to dig and understand the parallelization code (OpenMP) for documentation purposes.

  • Mar 20, ‘24:

1) Trying my best to thoroughly research the parts where OpenMP is used in the different C files (17 as per this filter), the objective of those for loops under #pragma omp parallel for ... for each of those 12 types of functions, the R functions that come from it, factors regarding speedup, and whatever detail I can provide. Going to send a PR to update the man page code and collect feedback on what I wrote.

  • Mar 21, ‘24:

1) Solved an issue related to using git.

2) Making decent progress - completed between(), CJ(), fcoalesce(), fifelse(), nafill(), types.c. Reading fread and fwrite (long files).

  • Mar 22, ‘24:

1) Finished writing about the remaining test cases (took more time to understand forder) and completed my version of the documentation of OpenMP use in data.table, sent a PR.