You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

318 lines
17 KiB

4 years ago
\section{Discussion}
\label{sec:discussion}
In the following sections, we reiterate on our findings presented in the last
section and try to derive answers regarding our seven hypotheses and research
question \textit{``Does an adjusted actuation force per key have a positive
impact on efficiency and overall satisfaction while using a mechanical
keyboard?''}.
\subsection{Impact of Actuation Force on Typing Speed}
\label{sec:dis_speed}
Our main experiment yielded that there are differences in typing speed for both
metrics related to transcribed text we measured―namely \glsfirst{WPM} and
\glsfirst{AdjWPM}. Especially the keyboard with the lowest uniform actuation
force of 35\,g―\textit{Nyx}―performed worse than all other keyboards. In terms
of \gls{WPM}, \textit{Nyx (35\,g)} was on average 4.1\,\% slower than
\textit{Athena (80\,g)} and \textit{Aphrodite (50\,g)} and 4.8\,\% slower than
the adjusted keyboard \textit{Hera (35 - 60\,g)}. Similarly, for \gls{AdjWPM},
\textit{Nyx} was 4.3\,\% slower than \textit{Athena} and \textit{Aphrodite} and
4.9\,\% slower than \textit{Hera}. The 4\,\% to 5\,\% difference in \gls{WPM}
and \gls{AdjWPM} in our sample account for approximately two words per
minute. When extrapolated with the mean daily keyboard usage of 6.69 hours
reported by our participants, this difference would be as big as 803 words,
which when put into perspective, is equivalent to roughly two full pages of only
written content (11pt font size). Although, this specific example would assume
constant typing for 6.69 hours, it is still a useful estimate of the loss in
productivity under normal working conditions over the course of several
days. These differences in \gls{WPM} and \gls{AdjWPM} could be explained by the
higher error rates and thereby the loss of ``typing flow'' we discuss in the
next section. \gls{KSPS} reflects the raw input speed by including backspaces
and previously deleted characters. The reason we included \gls{KSPS} in our
analysis was to reveal possible differences in the physical speed participants
type on a keyboard and not to further asses speed in the sense of
productivity. We could not find any statistically significant differences in
\gls{KSPS} but saw a trend, indicating that subjects typed a bit slower (<
3\,\%) on \textit{Athena (80\,g)} compared to \textit{Aphrodite (50\,g)} and
\textit{Hera (35 - 60\,g)}. With the differences in metrics that are commonly
used to measure typing speed more closely related to productivity (\gls{WPM},
\gls{AdjWPM}) and the trends that indicate a slight difference in operating
speed we could have accepted our hypothesis. However, with the relation between
error rate and typing speed described in the next section and the thereby rather
indirect effect of the actuation force, we can only partially accept our
hypothesis that a difference solely in actuation force, has an impact on typing
speed.
\begin{phga_hyp*}[1 $\rightarrow$ \cmark\kern-1.1ex\raisebox{.7ex}{\rotatebox[origin=c]{125}{--}}]
Actuation force has an impact on typing speed (efficiency - speed).
\end{phga_hyp*}
% During our telephone interviews 76\,\% of respondents would have preferred a
% keyboard with lighter actuation force.
% Our study tried to present the participant with a typing scenario that is as
% close to a typical text input situation as possible, by allowing but not
% enforcing the correction of erroneous input.
\subsection{Impact of Actuation Force on Error Rate}
\label{sec:dis_error}
As already briefly mentioned in Section \ref{sec:dis_speed}, measured error
rates like \glsfirst{UER}, \glsfirst{CER} and \glsfirst{TER} differed especially
between \textit{Nyx (35\,g)} and the other test keyboards. The statistical
analyses further revealed that \textit{Athena}, the keyboard with the highest
actuation force of 80\,g, produced on average 1\,\% less \gls{TER} than
\textit{Hera (35 - 60\,g)} and \textit{Aphrodite (50\,g)} and 3\,\% less than
\textit{Nyx (35\,g)}. Furthermore, \textit{Hera} and \textit{Aphrodite} both had
a 2\,\% lower \gls{TER} than \textit{Nyx}. Additionally to the quantitative
results, fourteen of the twenty-four participants also reported, that
\textit{Nyx's} light actuation force was the reason for many accidental key
presses. It further stood out that, as shown in Figure \ref{fig:max_opc_ter},
\textit{Athena} was the most accurate keyboard for 58\,\% of participants and
also more accurate than keyboard \textit{Own} for eleven of the
subjects. Overall, this concludes that a higher actuation force has a positive
impact on error rate.
\begin{phga_hyp*}[2 $\rightarrow$ \cmark]
Higher key actuation force decreases typing errors compared to lower key
actuation force (efficiency - error rate).
\end{phga_hyp*}
\textbf{Impact of \gls{TER} on \gls{WPM}}
The higher error rates and the possibility to correct erroneous input could have
also been a factor that led to lower \textit{WPM}. To evaluate the likelihood of
this additional relation we conducted a \gls{LRT} of fixed effects for our
linear mixed-effects model with two random effects (participant and first/second
typing test), fixed effect \gls{TER} and response variable \gls{WPM}. The
results of the \gls{LRT} ($\chi^2(1)$ = 110.44, p = 0.00000000000000022)
together with the trends of lower \gls{WPM} with increasing \gls{TER}, visible
in Figure \ref{fig:reg_ter_wpm}, suggest that the \gls{TER} indeed had an
impact on \gls{WPM}. This could have been because every time an error was made,
almost all participants decided to correct it right away. With a higher error
rate, this obviously leads to many short interruptions and an increased number
of characters that are not taken into account when computing the \gls{WPM}
metric.
\begin{figure}[H]
\centering
\includegraphics[width=1.0\textwidth]{images/reg_ter_wpm}
\caption{Regression lines for the relation between \gls{TER} and \gls{WPM}.
The trends indicate a decrease in \gls{WPM} with rising \gls{TER} and
therefore the existence of a relation between the two metrics}
\label{fig:reg_ter_wpm}
\end{figure}
\subsection{Impact of Actuation Force on Satisfaction}
\label{sec:dis_sati}
We tried to narrow down the rather broad term ``satisfaction'' to individual
categories that we, with the information gathered through our literature review
and telephone interviews, defined as necessary for a positive user experience
while using a keyboard \cite{giese_sati}. We decided for the following metrics
to evaluate whether or not a user experience with a keyboard that features
lighter actuation forces was more satisfactory:
\begin{table}[H]
\centering
\ra{1.0}
\small
\begin{tabular}{l}
$\rightarrow$ Pragmatic scale from the \glsfirst{UEQ-S} \\
$\rightarrow$ Score of the additional question \textit{``How satisfied have you been with this keyboard?''}\\
$\rightarrow$ Results of the \glsfirst{KCQ}\\
$\rightarrow$ Ranking of the keyboards during semi-structured interview\\
$\rightarrow$ Ratio of positive and negative feedback for each keyboard during semi-structured interview\\
\end{tabular}
\end{table}
\textbf{[\xmark] Pragmatic Scale (\gls{UEQ-S})}
As described in Section \ref{sec:res_ueqs}, we could not find statistically
significant differences for any of the test keyboards regarding the pragmatic
scale of the \gls{UEQ-S}. From visual assessment of the graph shown in Figure
\ref{fig:ueq_tkbs_res} we could conclude that there is a slight trend towards a
more positive rating for keyboards that utilized keyswitches with higher
actuation forces than \textit{Nyx (35\,g)}. This trend in the opposite direction
of our hypothesized outcome that lighter actuation force leads to more user
satisfaction, could be due to the longer familiarization time required for
keyboards with very light actuation force \cite{gerard_keyswitch}.
\textbf{[\xmark] Additional Question of Satisfaction with Keyboard}
The results deduced from the additional question \textit{``How satisfied have
you been with this keyboard?''}, which were answered on a \glsfirst{VAS}
from 0 to 100 after both tying tests with a keyboard, suggested that \textit{Nyx
(35\,g)}, the keyboard with the lightest actuation force and also
\textit{Athena (80\,g)} the keyboard with the highest actuation force, were rated
significantly worse than \textit{Aphrodite (50\,g)}. Additionally, \textit{Hera
(35 - 60\,g)}, the adjusted keyboard, showed a trend towards a significantly
better rating than \textit{Nyx}. These results indicate that neither of the
keyboards with extreme actuation forces were perceived as a overwhelmingly
pleasant keyboard to use during our typing tests. This is further supported by
the visualization of the mean ratings in Figure \ref{fig:res_tkbs_sati} where
the average ratings for \textit{Aphrodite} and \textit{Hera} were approximately
10 points higher than those for \textit{Nyx} and \textit{Athena}.
\textbf{[\xmark] Keyboard Comfort Questionnaire (\gls{KCQ})}
For the \gls{KCQ} we found several statistically significant differences. For
questions related to effort or fatigue while operating a keyboard,
\textit{Athena (80\,g)} received significantly lower ratings than the other test
keyboards. Additionally to the measured differences in error rates discussed in
Section \ref{sec:dis_error}, we discovered that participants also perceived the
accuracy of \textit{Athena (80\,g)} and \textit{Aphrodite (50\,g)} higher
compared to \textit{Nyx (35\,g)}. Similarly to the results discussed in the last
paragraph, the scores of the two keyboards with extreme actuation forces,
\textit{Nyx (35\,g)} and \textit{Athena (80\,g)}, fluctuated quite a bit and, on
average both keyboards scored lower than \textit{Aphrodite (50\,g)} or
\textit{Hera (35 - 60\,g)} (Figure \ref{fig:kcq_tkbs_res}). Thereby, these
results do not indicate a clear trend towards enhanced user experience when
using keyboards with lower actuation forces.
\textbf{[\xmark] Post Experiment Ranking of All Keyboards}
The ranks in terms of favored test keyboard, provided by all twenty-four
participants during the post-experiment semi-structured interview, can be
observed in Figure \ref{fig:tkbs_ranking}. The results further support the
tendencies towards keyboards with medium actuation forces, that we already
observed in the last couple paragraphs.
\begin{figure}[H]
\centering
\includegraphics[width=0.8\textwidth]{images/tkbs_ranking}
\caption{Rankings for only the test keyboards, gathered during the
post-experiment interview. It was possible to rank two or more keyboards the
same}
\label{fig:tkbs_ranking}
\end{figure}
\textbf{[\xmark] Ratio of Positive and Negative Feedback}
Lastly, we analysed all recordings of the post-experiment interviews and
categorized the feedback given for each keyboard into positive and negative
responses. We then calculated a ratio of these responses, which can be seen in
Figure \ref{fig:ratio_interview}, to evaluate preferences towards specific
keyboards that could not be expressed by our participants through any other
supplied method during the experiment. Like all other factors we identified as
reasonable indicators for satisfaction, these ratios yielded, that neither
\textit{Athena (80\,g)} nor \textit{Nyx (35\,g)} received more positive than
negative feedback. It should be noted, that previous research has shown that
people tend to remember and process bad experiences more thorough than good
ones, which could be a reason for the increased number of negative feedback for
\textit{Nyx} and \textit{Athena}, but would also indicate a worse experience
with those two keyboards \cite{baumeister_bad}.
\begin{figure}[H]
\centering
\includegraphics[width=0.80\textwidth]{images/ratio_interview}
\caption{The ration of $\frac{Positive Responses}{Negative Responses}$ during
the semi-structured interview for all test keyboards}
\label{fig:ratio_interview}
\end{figure}
\textbf{Conclusion}
Contrary to the responses of our preliminary telephone interview, where 76\,\%
of attendees preferred a keyboard with light actuation force, none of the
factors we defined as relevant for user satisfaction suggests that keyboards
with lower actuation force are more satisfactory to use than keyboards with
higher actuation force. Therefore, we have to fully reject our hypothesis. We
can conclude thought that keyboards with actuation forces in between those two
extremes (35\,g / 80\,g), are persistently perceived as more pleasant to use and
that ratings of keyboards with extreme actuation forces are highly influenced by
personal preference. The latter is indicated by the high fluctuation of almost
all responses regarding our evaluated factors.
\begin{phga_hyp*}[3 $\rightarrow$ \xmark]
Keys with lower actuation force are perceived as more satisfactory to type
with than keys with higher actuation force.
\end{phga_hyp*}
\subsection{Impact of Actuation Force on Muscle Activity}
\label{sec:dis_emg}
In contrast to other studies that suggested that actuation force has an impact
on muscle activity, we could not identify significant differences in terms of \%
of \glsfirst{MVC} for any of our \gls{EMG} measurements. Only a slight trend
that \textit{Nyx (35\,g)} produced the highest flexor \%\gls{MVC} for only
14\,\% of participants, could be interpreted as anecdotal evidence towards our
hypothesis that actuation force has an impact on muscle activity. Therefore, we
have to reject our hypothesis.
\begin{phga_hyp*}[4 $\rightarrow$ \xmark]
Differences in actuation force influence muscle activity while typing.
\end{phga_hyp*}
%\subsection{Impact of an Adjusted Keyboard on Typing Speed, Error Rate and
% Satisfaction}
\subsection{Implications for the Adjusted Keyboard}
\label{sec:dis_hera}
As discussed in the previous sections, there were no statistically significant
differences in terms of satisfaction for any of the test keyboards, including
our adjusted keyboard \textit{Hera}. Still, the rather unconventional design
choice of non-uniform actuation forces across a keyboard did not negatively
influence the satisfaction compared to \textit{Aphrodite}, which was often
perceived as equivalent to the participant's own keyboard. In fact,
\textit{Hera} was the keyboard with the most occurrences in the top three, tied
first place with \textit{Aphrodite} and was never ranked 4th place during the
post-experiment interview (Figure \ref{fig:tkbs_ranking}). Since \textit{Hera},
among others, utilized keyswitches with light actuation force (35\,g), the
satisfaction could improve during prolonged usage, because of the longer
familiarization period required by keyboards with lighter actuation forces
\cite{gerard_keyswitch}. Interestingly, participant \textit{I3Z4XI7H} who
reported a currently present disease of the left arm and wrist (Syndrome Sudeck,
complex regional pain syndrome (CRPS)), ranked Hera 30 points higher than all
other keyboards. \textit{I3Z4XI7H} also reported in the post-experiment
interview that \textit{Hera} was surprisingly pleasant to use and that pain was
significantly lower than with all other keyboards including
\textit{Own}. However, because of the nearly identical scores to
\textit{Aphrodite} in almost all categories, we have to reject our hypothesis
that an adjusted keyboard is more satisfactory to use than standard keyboards.
\begin{phga_hyp*}[7 $\rightarrow$ \xmark]
An adjusted keyboard is perceived as more satisfactory to type with compared
to standard keyboards.
\end{phga_hyp*}
Similarly, the resulting error rates measured for \textit{Hera (35 - 60\,g)}
were close to equal to the results of \textit{Aphrodite (50\,g)} and for speed
related metrics between those two keyboards only slight improvements while using
\textit{Hera} in \gls{WPM} (0.8\,\%), \gls{AdjWPM} (0.6\,\%) and \gls{KSPS}
(1\,\%)― that were not statistically significant―were recorded during our
experiment. It was still interesting to see, that \textit{Hera}, out of all four
test keyboards, was the fastest for eleven (45\,\%) out of the twenty-four
subjects and that albeit the usage of 30\,\% keyswitches\footnote{That were
actually pressed during our typing tests} that required 35 - 40\,g actuation
force, which is similar to the actuation force of \textit{Nyx (35\,g)}, we did
not see comparably high error rates. Because of the lacking evidence, that an
adjusted keyboard produces less errors or supports the typist in achieving
higher typing speeds, we have to reject our two hypotheses regarding those
improvements.
\begin{phga_hyp*}[5 $\rightarrow$ \xmark]
An adjusted keyboard improves typing speed compared to standard keyboards
(efficiency - speed).
\end{phga_hyp*}
\begin{phga_hyp*}[6 $\rightarrow$ \xmark]
An adjusted keyboard is perceived as more satisfactory to type with compared
to standard keyboards.
\end{phga_hyp*}
Our experiment basically revealed that keyboards which utilized keyswitches
with actuation forces that were neither too light (35\,g) nor too heavy (80\,g),
generally outperformed keyboards which featured those extreme actuation
forces. In the following section, we elaborate on possible limitations of our
experimental design and future research that could be reasonable to further
investigate advantages and disadvantages of adjusted keyboard designs.
% ---