RKWard
General info
The RKWard project is currently in the process of moving to KDE.org. See Incubator/Projects/Rkward. Current web site is still at Sourceforge.net: http://rkward.sf.net .
Feedback
This page is going to be a hub for RKWard related feedback and work-in-progress tracking. Might need subdividing over time.
Note that this is intended for things that may need discussing. Not for fully-specified feature requests. The latter should go to the tracker.
Installation issues.
Mac OS X
Abatty: I was surprised to see that there is a problem in OSX Yosemite, as most of my students use Macs (as do I), and I haven't heard any caterwauling from those who have updated. I know that the one student who actually seems capable of following screenshot-laden instructions for things in RKWard is running it under Yosemite seemingly without issue...
Abatty: The struggle I've run into with many Mac users, though, is just that they have to install R and then RKWard. This is very unintuitive for people who aren't used to installing anything more involved than the Facebook app on their cellphones. I.e., "digital native" undergrads. It would be great if those could be bundled together again.
Windows
Abatty: With the Windows package, the issue is that it's just a .zip and you can run it from anywhere. People really aren't used to that, and they don't know how to add a shortcut to the Start menu/giant-useless-screen-with-live-advertisements. Also, many people are confused/freaked-out when a scary-looking black terminal window comes up upon starting it. The Windows package also seems pretty fragile. Most of the Windows students have tossed it and reinstalled it at least once. It's intermittent, and I haven't been able to reproduce it, but it just stops working at some point. The terminal window comes up, and that's all. I'd give you guys more information if I had it. Just generally speaking, the Windows version is not as reliable as the Mac, I'm finding.
Data import
Abatty: This is just rather confusing. You go to File or the Open... menu in the toolbar, and go to Import, and then there are a bunch of options. You have "Import Data," then a line, then "Import format," which is a submenu about importing data. If you use the first one, it just opens up the SPSS data importer, and there's no way to tell it otherwise. I have started providing my classes with .sav files for everything to just spare myself the headache of pointing out again and again, that they actually need to go to "Import → Import format → Import text / CSV data." It doesn't make sense because the way that menu is organized, it looks like that is some sort of separate function from importing data, because it's under the line separating it from "Import Data."
Abatty: Once they navigate to "Import text / CSV data," however, the problems continue, because despite the fact that we've already ostensibly told the software that we're working with .csv or whatever, there are a bunch of options where we have to tell it again. The default format is "None," and the options for the quick formats (which are likely all anyone will need in most cases, especially in a class where a teacher is providing the data set) are on the bottom left, looking not-very-important. This could maybe be fixed just by moving elements around, like putting the format selection settings in a dropdown menu at the top, labeled with "Please select the format of your data" or something. Then rename "None" to "Custom" and put it at the bottom of the list.
Abatty: Every time I try to use a CSV in class, I import it in front of god and everyone on the projector, then demonstrate running the analysis, and say, "Okay, is everyone with me?" and half of the students' hands shoot up and I go look at their screens and they have the entire table in one column and the analysis won't run, of course.
Abatty: The quickest/easiest fix right away for this would be to just kill the "Import Data" entry, rename "Import format" to "Import data format" and move it above the line in the menu, separating it from the script options. Just doing that would at least avoid the problem of people finding themselves stuck in the SPSS format importer, trying to feed it CSV data or whatever.
Distributions dialogs
Abatty: I only started using these this semester, so I hadn't noticed, but these are unintuitive enough that I decided to just get the students to do them in the R Console.
Abatty: In the "r" dialogs (i.e., the ones that call rbinom and rgeom, etc.), the values that will be fed into R are filled in by default. Why? That seems to communicate, for example, that the "normal" thing to do when looking at a binomial probability is to set the number of "successes" to 0.95, and only 1 trial, and with a 50% probability, when actually, these are likely to all be different. I think these should just be blank.
Abatty: "Vector of quantiles" really ought to be renamed "Number of successes." I'll be honest here: I couldn't figure out what I was supposed to do here, so the only way I figured out how to use these dialogs was by clicking the "Code" button to see what commands it would be calling, and then reading the CRAN documentation on those functions. The CRAN documentation was much clearer.
Abatty: It would be nice if there was another calculator that called the "d" functions as well... Not that I've really ever used either of these, but they are pretty important for teaching undergraduate stats.
R Console
Abatty: It would be great if there were a way to permanently set text size there, or at least have a keyboard shortcut to change it. It's tiny by default, and the only way to get it bigger is to keep going up to "View" and clicking "Enlarge font" over and over. Then upon restart of RKWard, it's back to tiny again. This is another teaching issue more than anything. If I need the students to do something in R, it's very hard to demonstrate without bumping that way up, but it takes forever as I keep going up to that menu over and over until I see the students stop squinting at the projector screen!
Plotting issues of various kinds
Abatty: I can't seem to figure out how to make a grouped barplot out of the Plots → Barplot dialog. I've been telling my students to do it via the "N to 1 Crosstabulation" dialog, where it works fine.
Abatty: A Q-Q/Normality plot in the Scatterplot dialog would be awesome. The only place I can find that in RKWard seems to be one of the options in the Scatterplot Matrix dialog.
Abatty: It would be very helpful if the Scatterplot pulled axis labels from the Label header, or, if empty, the variable name itself, for the variable by default. Right now you have to enter labels manually. This results in me getting a lot of test answers labeled "Xrange" and "Yrange," which I still give full marks for if they are obviously the right variables.
Outputs
Abatty: Exporting outputs is the area of the most confusion for my students. The only option for getting things out of the output is to export the entire giant HTML file, which still just pulls graphics from the invisible .rkward directory. To get a histogram out to submit for my test (I give tests with Moodle), the process is:
1. Export the output to HTML. 2. Open the output in your browser. 3. Locate the desired graphic on the giant page of everything you've ever done with the software ever, unless you've flushed the output, but you were afraid to do that so it's still there. 4. Right-click the graphic and save it to a file. 5. Upload it to Moodle.
Abatty: A big part of this, of course, is that RKWard is running KDE, not the host OS, really. But this is a lot of steps for an undergrad or not-terribly-savvy user to complete correctly. Every time I've taught with RKWard, whether it be to undergrads or colleagues, everyone gets lost here. They can see the graphic right there in front of them. They don't understand why they can't just export it directly from the output (they'd love to just drag it out, but I point out that even SPSS doesn't do this, and JMP is a pain, too).
Abatty: Even I, as a pretty confident user of RKWard, find this to be a massive PITA, and what I do is just navigate to the .rkward directory and keep it open as I work, sorted by date, and just copy stuff out of there as needed. But even when I show people this workaround, they invariably MOVE things out of that directory and then cry, "Where did it gooooooo???" the next time they open RKWard.
Abatty: The whole system is extremely unwieldy and unintuitive, and what it means is that I get a lot of screenshots of RKWard on the tests. Even of the tabular outputs!
Abatty: This, I suspect, is a much bigger fix. What I would love to see, though, is that output HTML file and the associated images be in a folder with the workspace if saved, similar to how it works in SPSS, but less locked down and headache-inducing. So perhaps you open RKWard up to just play with some ideas, as I often do, and then opt to discard the workspace on close. While you're working on an unsaved workspace, the behavior is the same as now, but when you close it, the output files are also discarded. If you save the workspace, however, it creates a folder containing the R and RKWard workspace files, the same as now, but also a folder containing the outputs. Loading that workspace later will use that output folder instead. This solves the problem of an endless scroll of everything you've ever done with the software because if you even need one thing in there, you can't flush it, and makes it much easier to find things later. The current HTML-based output system is wonderful in that it can be opened in your browser and all the graphics files are already "exported" to PNG, but since they are locked away in an invisible directory, and necessarily chained to everything else you've ever done, it makes it very hard to get information out of it.
Abatty: Last comment about the outputs is one that has already come up here before: The tables are, by and large, very ugly. There needs to be more of a margin inside cells, and it would be nice if there were borders by default. Also, many of them have far, far too many decimal places. The contingency table/crosstabs are the worst for this. If any of the values need decimal places, they all get them. So even though most of the cells are actually just counts/sums, they all have approximately one million zeros behind them. It's just very hard to read.