Add ability to make n-way tables #22

mbcann01 · 2020-02-15T02:44:10Z

Overview

Currently, I freqtables will only create one- and two-way tables. It will not create n-way tables. We want to add the ability to create n-way tables.

What I had in mind was something like:

demo_nih %>% 
    freqtables::freq_table(ethnicity_nih, race_nih, sex_nih)

However, what I've been doing in the meantime is:

make_table_section <- function(cat) {
  demo_nih %>% 
    filter(ethnicity_nih == cat) %>% 
    freqtables::freq_table(race_nih, sex_nih) 
}

And then:

purrr::map_dfc(
  .x = c("Not Hispanic or Latino", "Hispanic or Latino", "Unknown/Not Reported"),
  .f = make_table_section
)

Obviously, this is more verbose, but it gets the job done and is very versitile (e.g., user can return a list instead of a data frame). However, the spirit of freqtables isn't really to be the most "versitile" package. It's to be the easiest to use "out of the box" for 85%+ of normal use. Give this some thought.

The suggestion from a user on RStudio Community could also be useful:

mtcars %>% 
  gather(variable,category,cyl,vs,am,factor_key = TRUE)%>%
  group_by(variable,category)%>%
  summarize(n=n())

Left off at

2023-03-17: Working on test.Rmd as part of #40.

I created two data files for comparing freqtables with Stata and SAS.
The data files are called /inst/extdata/freq_study.dta and /inst/extdata/freq_study.xpt.
These data files are created using data-raw/study.R.
I also created a do file - /inst/extdata/compare_freqtables.do - and a SAS script - /inst/extdata/compare_freqtables.sas.
I added all of these files to buildignore.

2020-06-11: Created test.Rmd on the plane to Minnesota to test out different ways of doing this. test.Rmd is git ignored and build ignored.

Tasks

Complete one, two, and n-way tables in Stata (/inst/extdata/compare_freqtables.do). Use them for comparison.
Complete one, two, and n-way tables in SAS (/inst/extdata/compare_freqtables.sas). Use them for comparison.
Figure out how you want freq_tbl to treat n-way tables.
Figure out how you want freq_table to treat n-way tables.
Figure out how you want freq_test to calculated stats for n-way tables.

The text was updated successfully, but these errors were encountered:

mbcann01 · 2020-06-14T16:11:59Z

Here's how Stata does it:

mbcann01 · 2020-06-14T16:13:42Z

2020-06-15:
We may want to distinguish between an n-way freq_table (shows overall n, prop, ci like Stata) and a grouped_by n-way freq_table that uses row n's and percents instead.

mbcann01 · 2020-06-23T21:33:07Z

2020-06-11 - Notes from while I was on the plane:

If you allow group_by to work again, then you may need to change the descriptive analysis vignette.
If you allow 3-way tables then you will have to change some of the wording in the descriptive analysis vignette. Specifically, in Bivariate percentages and 95% log transformed confidence intervals.
If you change the row/column terminology to group/subgroup terminology then you will have to change some of the wording in the descriptive analysis vignette.

Row/column labels

What is the best way to create a contingency table? Then row/column makes sense.
group level 1, group level 2, group level 3
group, subgroup 1, subgroup 3

mbcann01 · 2020-06-24T01:11:00Z

Do this in a new branch

mbcann01 · 2020-07-21T01:59:39Z

So, meantables uses group_by. Then, the output is labeled "response_var" and "group_var". It might be worth considering keeping this consistent.

One var

mtcars %>% freq_table(am)

Two+ vars

mtcars %>% group_by(mpg) %>% freq_table(am)

Could even have "response_var" (or something similar) and "group_var" in table of results.

mbcann01 · 2020-08-19T18:07:20Z

Needed it on the Sun Study for this (as an example):

map_student %>% 
  filter(!is.na(ss_application_f)) %>% 
  freq_table(period_f, teacher_f, ss_application_f)

Tested this in stata with: by period_f, sort : tabulate teacher_f ss_application_f, chi2. It returns this:

Tried it in SAS using :

proc freq data=map_student;
	tables period_f * teacher_f * ss_application_f;
run;

Which returned

I also tried proc surveyfreq, but that won't return chisq for three-way tables.

mbcann01 added this to In progress in Bug fixes and enhancements Jun 1, 2021

mbcann01 added this to In progress in n-way tables Jun 1, 2021

mbcann01 removed this from In progress in Bug fixes and enhancements Jun 1, 2021

mbcann01 added this to To do in Bug fixes and enhancements Dec 6, 2021

mbcann01 mentioned this issue Jul 30, 2022

Use group_by with freq_table #40

Open

11 tasks

mbcann01 removed this from To do in Bug fixes and enhancements Jul 31, 2022

mbcann01 changed the title ~~Add ability to make 3-way tables~~ Add ability to make n-way tables Mar 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ability to make n-way tables #22

Add ability to make n-way tables #22

mbcann01 commented Feb 15, 2020 •

edited

mbcann01 commented Jun 14, 2020

mbcann01 commented Jun 14, 2020

mbcann01 commented Jun 23, 2020

mbcann01 commented Jun 24, 2020

mbcann01 commented Jul 21, 2020

mbcann01 commented Aug 19, 2020 •

edited

Add ability to make n-way tables #22

Add ability to make n-way tables #22

Comments

mbcann01 commented Feb 15, 2020 • edited

Overview

Left off at

Tasks

mbcann01 commented Jun 14, 2020

mbcann01 commented Jun 14, 2020

mbcann01 commented Jun 23, 2020

mbcann01 commented Jun 24, 2020

mbcann01 commented Jul 21, 2020

One var

Two+ vars

mbcann01 commented Aug 19, 2020 • edited

mbcann01 commented Feb 15, 2020 •

edited

mbcann01 commented Aug 19, 2020 •

edited