Abstract
This paper describes how cloud computing tools widely used in the instruction of data scientists can be introduced and taught to economics students as part of their curriculum. The demonstration centers around a workflow where the instructor creates a virtual server and the students only need Internet access and a web browser to complete in-class tutorials, assignments, or exams. Given how prevalent cloud computing platforms are becoming for data science, introducing these techniques into students’ econometrics training would prepare them to be more competitive when job hunting, while making instructors and administrators re-think what a computer laboratory means on campus.
Funding source: Emory University
Acknowledgments
We thank the editor and an anonymous referee for helpful comments that improve the readability and exposition of the paper. We also thank Amazon Web Services (AWS) Educate and Stata Corporation for providing us with a cloud classroom with credits and temporary Stata lab licenses for testing, respectively. AWS is Emory University’s preferred and recommended cloud service for faculty-led computational needs. Handel, Jacho-Chávez, and Rea acknowledge financial support from the Department of Economics at Emory University. The views expressed in this article are those of the authors. No responsibility for them should be attributed to the Bank of Canada. All remaining errors are the responsibility of the authors.
References
Athey, S., and L. Michael. 2019. “Economists (and Economics) in Tech Companies.” Journal of Economic Perspectives 33(1): 209–30. URL https://www.aeaweb.org/articles?id=10.1257/jep.33.1.209.10.3386/w25064Search in Google Scholar
Bryan, J. 2018. “Excuse Me, Do You Have a Moment to Talk About Version Control?.” American Statistician 72(1): 20–7, https://doi.org/10.1080/00031305.2017.1399928.Search in Google Scholar
Cicero, M. T. 44 BCE. “De Divinatione, II.(2).4.” In Loeb Classical Library (1923), English translation by W. A. Falconer, Cicero, Vol. 20, 375, as transcribed by Bill Thayer, https://penelope.uchicago.edu/Thayer/E/Roman/Texts/Cicero/de_Divinatione/2*.html#R2 (retrieved on September 23, 2020).Search in Google Scholar
Fiksel, J., L. R. Jager, J. S. Hardin, and M. A. Taub. 2019. “Using GitHub Classroom to Teach Statistics.” Journal of Statistics Education 27(2): 110–19, https://doi.org/10.1080/10691898.2019.1617089.Search in Google Scholar
Hansen, B. 2020. Econometrics. Retrieve from https://www.ssc.wisc.edu/∼bhansen/econometrics/.Search in Google Scholar
Ho, A. T. Y., K. P. Huynh, D. T. Jacho-Chavez, and D. Rojas. forthcoming. “Data Science in Stata 16: Frames, Lasso, and Python Integration.” Journal of Statistical Software.10.18637/jss.v098.s01Search in Google Scholar
Jupyter Project, B. Douglas, D. Bourgin, A. Brown, M. Bussonnier, J. Frederic, B. Granger, T. Griffiths, J. Hamrick, K. Kelley, M. Pacer, and L. Page. 2019. “nbgrader: A Tool for Creating and Grading Assignments in the Jupyter Notebook.” Journal of Open Source Education 2(11): 32, https://doi.org/10.21105/jose.00032.Search in Google Scholar
Kaplan, D. 2018. “Teaching Stats for Data Science.” American Statistician 72: 89–96, https://doi.org/10.1080/00031305.2017.1398107.Search in Google Scholar
Koehler, F. J., and S. Kim. 2020. “Interactive Classrooms with Jupyter and Python.” The Mathematics Teacher 111: 304, https://doi.org/10.5951/mathteacher.111.4.0304.Search in Google Scholar
Perkel, J. M. 2018. “Why Jupyter is Data Scientists’ Computational notebook of Choice.” Nature 563(7729): 145–6, https://doi.org/10.1038/d41586-018-07196-1.Search in Google Scholar
Popescu, D. A., N. Zilberman, and A. W. Moore. 2017. Characterizing the Impact of Network Latency on Cloud-Based Applications’ performance. Tech. Rep. UCAM-CL-TR-914. University of Cambridge, Computer Laboratory. URL https://www.cl.cam.ac.uk/techreports/UCAM-CL-TR-914.pdf.Search in Google Scholar
Stackoverflow. 2019. Stack Overflow’s Annual Developer Survey. Online. URL https://insights.stackoverflow.com/survey/2019.Search in Google Scholar
Wikipedia. 2020a. Distributed Version Control. Wikipedia. URL https://en.wikipedia.org/wiki/Distributed_version_control.Search in Google Scholar
Wikipedia. 2020b. Graphical User Interface. Wikipedia. URL https://en.wikipedia.org/wiki/Graphical_user_interface.Search in Google Scholar
Wikipedia. 2020c. HTML. Wikipedia. URL https://en.wikipedia.org/wiki/HTML.Search in Google Scholar
Wikipedia. 2020d. Integrated Development Environment. Wikipedia. URL https://en.wikipedia.org/wiki/Integrated_development_environment.Search in Google Scholar
Wikipedia. 2020e. Jupyter Kernels. Wikipedia. URL https://en.wikipedia.org/wiki/Project_Jupyter#Jupyter_kernels.Search in Google Scholar
Wikipedia. 2020f. Live Coding. Wikipedia. URL https://en.wikipedia.org/wiki/Live_coding.Search in Google Scholar
Wikipedia. 2020g. Machine Learning. Wikipedia. URL https://en.wikipedia.org/wiki/Machine_learning.Search in Google Scholar
Wikipedia. 2020h. Markdown. Wikipedia. URL https://en.wikipedia.org/wiki/Markdown.Search in Google Scholar
Wikipedia. 2020i. Open Source. Wikipedia. URL https://en.wikipedia.org/wiki/Open_source.Search in Google Scholar
Wikipedia. 2020j. Programming Language. Wikipedia. URL https://en.wikipedia.org/wiki/Programming_language.Search in Google Scholar
Wikipedia. 2020k. Software Repository. Wikipedia. URL https://en.wikipedia.org/wiki/Software_repository.Search in Google Scholar
Wikipedia. 2020l. Virtual Machine. Wikipedia. URL https://en.wikipedia.org/wiki/Virtual_machine.Search in Google Scholar
© 2020 Walter de Gruyter GmbH, Berlin/Boston