ISYE-6414 - Regression Analysis

Search for a Course:

    Toggle to Select Spcific Semesters

    Reviews


    Semester:

    Easy, enjoyable class. Much of the content was familiar if you’ve already taken ISYE 6501, but this class does go far deeper into the math and assumptions behind the models. I almost never watched the lecture videos, instead I would download the slides and transcript and read along + take notes that way. Very easy to do well on the coding sections of HWs and exams, multiple choice is harder but weighted lower. For exams, I was generally able to ace the coding sections and get ~80 to 85% on the MC (with pretty minimal effort into studying) and secured an A in the class.


    Semester:

    This course has a bad rap, undeservedly in my opinion.

    1. People complain about the peer grading, but this class has a specific rubric to follow, unlike prior courses such as ISYE 6501 where the difference between a 100 and a 90 is a subjective “did this student appear to go above and beyond”. This is an improvement.

    2. Another complaint is the unfair MC. If English is not your first language, I can take these complaints legitimately. Otherwise, MC is 40% of the tests, meaning you can get a 75% on your MC section and still get an A on the test. In practice this is missing 6-7 questions out of 25-30. This is DOABLE. Those who complain about MC tanking their grades probably failed easy parts of the coding section and are looking for excuses.

    3. The Coding Sections. First MT, I finished with 2 hours to spare, Second MT w an hour, Final w an hour to spare. There are no gotchas. If you download the coding examples freely available in the documents section of Canvas and are comfortable with using ?function in R/thinking on your feet, you should be able to get close to a 95-100 on these sections.

    To conclude, don’t be scared by the reviews you may see here. The course is pretty good, not perfect, but certainly LIGHT YEARS better than the disaster that is 6501.


    Semester:

    A course where you will feel lost from the beginning, the lectures are not good and the exams do not show your knowledge of the subject, do not waste your money with this course.


    Semester:

    To those that hated the exams because it seems to be poorly written…

    Congratulations, you’ve been “Serbanated”.

    This exam teaches us how to be street-smart, and to be able to think critically, not just book-smart - something that you can memorize and regurgitate - that’s (ironically) true for B-track classes.

    Welcome to the art of the Analytics.

    Agree with the below reviewers that this class should be replaced in lieu of MGT 8803.


    Semester:

    tl:dr course is NOT THAT BAD, folks just need to step up their game. Im sorry but it is the truth.

    Before I took this course, I read the reviews here and I was mortified by what I read, so going into the course, I was worried.

    Now, it is a shame that this course gets as much hate as it does because it is not that bad. As an analyst at a big bank, I’ve got to be quick on my feet and new learn things as the need arises so when I think about the pacing of this course I found the material pacing to be pretty doable (compared to a course like ISYE-6501 where its just material in and material out week after week). This gave me enough time to actually go in-depth a little bit and understand more about the different topics like SLR, MLR, Stepwise, and the various Var Selection techniques. Seriously, they should do away with MGT-8803 as a mandatory course and swap it out with this one.

    Pros:

    1. Good pacing (sometimes too good, could be slightly sped up)
    2. Exam questions were fair (seriously, compared to Sokol’s questions, Serban’s were pretty tame and not that tricky, just have to read carefully and think critically, after all this is a graduate level course in Analytics)
    3. Coding section (not too difficult if you have experience in programming especially in R)
    4. FANTASTIC TAs. I must’ve lucked out and had a great group of TAs because they were awesome, and knowledgeable. (Seriously, one guy asked a lot of repetitive questions that could’ve been searched back in the notes, but the TAs were still patient and answered him. This isn’t a bad thing as it provides an opportunity for other learners to review the content so shoutout to him )
    5. Good balance of homeworks and knowledge checks

    Cons:

    1. Coding sections of the exam were too long. Not difficult, just long and tedious.
    2. I will side with everyone here and say that the videos were indeed very dull. Read the transcripts side by side with the powerpoints and review the powerpoints as you prep for exams.

    Disclaimer: I come from an Econ/stats background where two of my undergraduate courses were heavily focused on Statistics and Econometrics


    Semester:

    A lot of strong criticism of this course, and my thought the entire time was “this should be mandatory for this program.” I think this provides a very strong foundation of concepts that are quickly reviewed or glossed over in other courses. I understand the frustration on wording things, but in business settings, you need to be able to adapt and answer questions no matter how they are presented to you.

    I do think the material was dry at times, but the homeworks pulled no punches, they were straightforward and if you understand the material, and can write a little bit of R, I think this course is very fair and should be taken by everyone.

    My biggest recommendation is that people save this for the Summer term as taking it in the spring actually felt like there were “dead” zones of material. I think it would be perfectly paced to take in the summer and I wish I would have rearranged my courses to accomodate for that.


    Semester:

    Like what others said, the material is easy, I have learnt all the knowledges before. The key for success (A) is to be very good in English, so when there are tricky wordings, one has higher chance to figure it out. The wording game has been played everywhere in this course, I felt that the material is too easy that everybody could get an A, so there has to be a way to distinguish students, and testing proficiency of English become the solution.

    The way of wording games are for example, use fancy word for basic yes/no ideas, so if you don’t know the fancy word (most none native speaker), too bad. T/F question of if XYZ should be blue not red, where in fact XYZ does not exist in the first place, so answer is False… Multiple choice questions asking to select the True ones following another question asking to select the False ones and the very next question will switch to select True ones. Well, it is the students’ responsibility to read the questions carefully, but, to me, I felt this should not be played with Master’s students… I’d rather have some essay questions, or have some projects to work on.

    I’m expecting an A for this course, it does systematically brushed through the regression models, especially the assumptions and Gof parts of them. But I felt it could be a much better course than what it is now.


    Semester:

    Exams are truly poorly written. You can understand the material and regression but still miss questions because of the way the test is written. I found myself always having to think of cases in which certain questions were/weren’t true. They are asking true or false questions for things that should really be essay questions.

    The test don’t really test on knowledge or whether you actually learned anything, they are more so testing on whether you memorized the exact wording in the video. Video quality are really bad as well. I think the class needs an entire reboot.


    Semester:

    Really bad class. I learned way more about regression in DMSL and 6501 than I did in this class. The actual content isn’t hard, but the lectures are pretty much useless and the tests are unfair. Past students have compiled all the lectures into a single Google Doc and use that instead of watching the lectures.

    With regard to the tests, a lot of the questions are “gotcha” questions. As a simple example, a T/F question might say something like “Logistic regression is an unsupervised algorithm for classification problems.” The answer being false, since it’s supervised. This is not an actual or hard example, but the point being is that the questions will often hinge on a single word, so you must read very carefully - and if English is not your first language, good luck.


    Semester:

    I find the previous reviews of this course to be a little unfair. The lectures can be dry, and the audio quality is low, but people need to get out of here with saying stuff like “unwatchable” or “unlistenable”. The hyperbole says more about the reviewer than the class itself.

    The content of this class is exactly what you need for a regression class. They explore the more frequently used GLMs, explore the assumptions & the statistical underpinnings, and you apply these concepts in R. Pretty straightforward. If you are looking to become, or are working as a corporate data scientist, I consider these topics a must. My biggest problem with the class is that it doesn’t get into more advanced topics in regression.

    The course is not particularly difficult: if you complete the homeworks the open-book examinations should be straightforward. If you take your time with the self-assessments the closed-book exams should be manageable. I think the biggest complaint people have with the closed book is that the questions attempt to “trick” you by leading to a false solution. If English isn’t your first language I sympathize, otherwise I think this is par for the course with a university-level exam. Read each question carefully.


    Semester:

    Overall, I thought this course was just okay. I did my undergrad at Georgia Tech in Industrial Engineering and took a regression & forecasting class during that time and I think I learned more in that class.

    I definitely recommend just reading the lecture transcripts and following those along with the powerpoint slides over listening to the lectures. It is much faster this way and I took notes in a notebook from that exercise. It still does take quite some time to go through it all, though.

    The homeworks have two parts: a multiple-choice & true/false quiz and an R coding portion. For the MC/TF section, it’s super helpful to have the powerpoint slides/lecture transcripts in front of you to ctrl + f to find the relevant section. For the R coding section, for the most part, you can find similar code in the lectures slides; however, it does take a while to complete that portion of the homework, with both coding and writing out explanations to answer the questions. Definitely learn how to work R Studio (or Jupyter Notebook) in advance with knitting your assignment together so that you are prepared to turn your homework in on time.

    The tests are set up the same way as the homeworks: a MC/TF section (where you can make a crib sheet) and an R coding section (open book, meaning you can look at lecture notes, past homeworks, etc. but not open internet so you cannot google anything or go to stack overflow).

    There are a lot of complaints about the quality of the lecture material and I’d agree (like I said, I think I learned more in the GT undergraduate class in this topic). There’s also a lot of complaints about the unfairness of vague questions that could go either way (especially true/false questions). I think I’d agree here as well (it’s hard to truly gauge a student’s understanding with a true/false question since they have a 50% chance if they just guess).

    I would still recommend this class for a fairly easy statistics elective that is “doable” without too much pre-requisite knowledge.


    Semester:

    I really loved this course. I enjoyed the material a lot. I had to study quite a bit before each exam in order to prepare for her multiple choice and T/F questions, but to be honest, they’re not that difficult. I got an A in the course. I really liked the HW and exam structure because the questions are very very specific. They never leave your head scratching like “what kind of answer am I supposed to write here? What are they looking for?” like some other professors do that love giving open-ended questions and then people are forced to spend hours on piazza or going to office hours to figure out what the heck they want. I also liked that because the questions were so specific, there was not enough room for peers to take off points out of laziness or maliciousness like in some other courses were everything is left to the peers’ subjective opinions. I really hate courses structured that way. Thus, I loved Dr. Serban’s Regression Analysis course very much. She also gives lots of good advice throughout her lectures if you are paying attention: “do not this, avoid that, always do this, etc.” In fact, I might go over all the materials again as a refresher.


    Semester:

    This is a prime example of where the material (on paper) is incredibly important to analytics, but the delivery basically kills the packaging of this material as an actual class.

    Why is this class so horrible? Let me count the ways… It’s purely a money grab by the University. The teacher is non existent. The material is HORRIBLE!! The lectures are impossible to watch, and they are riddled with errors. I kid you not, there was an ever growing pinned post thread on Piazza for lecture (and other class material) errors. The lecture material was also BLOATED to no end.. Hundreds of pages of slides that had little to no sequential flow. The HWs were OK, but you had to constantly search the internet to find answers since the lectures were so poorly constructed.

    As for the exams… wow, what an archaic and outdated philosophy. The exams were divided into 2 parts. FIRST PART: Coding. Imagine having lecture material, HWs, practice exams and knowledge checks reinforce a certain subset of topics. Then, imagine an exam that tests on a completely different subset of topics… And then, imagine having to code like a mad person for 2-3 hours straight. Oh, and you can use all the material you need, you just can’t go online to figure out a coding item if you’re stuck… PART 2 is just unnecessary. This is the true/false and multiple choice piece of the puzzle. Again, the HWs, practice exams, etc. reinforce a certain subset of material. Then, the exam comes along, and it’s a bunch of topics (albeit in the vast wasteland of lecture material) that were never focused on. The worst part is that the questions are designed to trick you. You can tell that they spent considerable amount of time designing questions where it could go true or false (depending on the way the question was read), and the MC always had 2-3 correct answers (and the correct answer was vague and had to do with understanding the ‘trick’ in the question as opposed to understanding an actual analytics concept)..

    I know I read these reviews for practical advise, so here’s my 2 cents. If you want to know more about regression, read one of the many wonderful books on the topic. You’ll get more out of 4-5 hours of a good book on regression in analytics then this class will provide to you. If you need to take this class to fulfill an elective (no other choices interest you), then here’s the best way to survive this class. If you are good in R, then make sure you score 100% on the coding HWs. Get a good study partner to compare notes for the TF/MC parts of the HW. For the exams, make sure you study all the coding concepts, not just the ones that are the focus of the class activities. As for the TF/MC part of the Exams, make sure you make a robust cheat sheet (you get to have a double sides cheat sheet), for the questions could come from anywhere within the 100s of lecture slides. Make sure you read the question multiple times and look for the ‘trick’. Don’t get frustrated by the ridiculous nature of the exam. Seriously, the questions are sometimes as simple as “what is 2 plus 2?” But, there’s usually some trick in there..

    If you’re not great at R, make sure you do all the R exercises.. there are some in the lectures, but also lots of example code in the material provided. You’re on you own a bit, and the persons that put together the code leave little or no comments.. However, prepping the coding material prior to the exam will give you a shot at finishing within the time allotted… It is amazing how 30 mins can be eaten up pretty quick if you do not know a concept. Plus, many of the questions build upon previous code, and if you’re stuck on a question, you can’t always just move on.

    Good luck to all that have to take this horrendous class!!


    Semester:

    Pro:
    +Interesting subject matter
    +Coding Homework (can be done in R or python)
    +Good examples in class

    Con:
    -Not the best at explaining the material (ex. there are many similarities between simple linear regression and multiple linear regression, a single slide showing these similarities and few differences would have been great, instead she basically just repeated the lecture on simple linear regression verbatim, I actually thought I was watching the wrong video)
    -MC and TF homework and tests are made to trick you. I think because it is a masters class they wanted to make it hard but they don’t test it in a way that promotes learning.


    Semester:

    Horrible lecture materials. Videos are impossible to watch, had to read transcripts instead, which are based on the lectures so you get the same poorly structured sentences in English. TAs grade exams harshly and it takes them forever (why not do it like in 6203?). Never saw the professor on Piazza. Overall very confusing material. There is plenty of material on this subject online, but you have to read prof. Serban crappy lectures to get through the class. The course could have been better, but instead made so bad… That is not GT quality.


    Semester:

    Very poorly structured class; lectures are all over the place and divided into a bunch of different videos that are very difficult to follow/ very unorganized. Quizzes test very specific parts of the lectures instead of the broad concepts which is very frustrating. Questions on quizzes and exams are MEANT to trick you which never helped me retain a single thing. T/F questions will have the tiniest caveat that makes them False even though 99% of the statement is True. Response time on Piazza was absurdly long/sometimes nonexistent. Overall, I do not recommend this class.


    Semester:

    I rated this class as Medium only because the multiple choice questions on the homeworks, midterm, and final can be tricky. Also, if this is the first exposure to detailed regression analysis, it could be considered difficult. By the time I took this course, I not only had taken several courses involving detailed regression applications, but have also used it in my job.

    The lectures are at times like watching paint dry, but there are answers to the homework questions that may not be in the slides. Each of the homeworks has two parts: True/False/multiple choice and R Coding with explanation write-up. It really isn’t too much work, and I think is the perfect course to take over the summer. I took this with my practicum, and while it could be stressful around the midterm and final, I didn’t find the workload too heavy.

    One thing that can be good or bad is that all of the coding parts of the homework are peer graded. If you get a person with too much time on their hands, you can get shafted on this. The good thing is that homework is only 15% of the total grade.

    The midterm and final count for 40% and 45%, respectively, of the overall grade, so it is important to put in the time studying. We were allowed “cheat” sheets for both, and it helps to put in the effort on creating these study aids.

    Last tidbit I’ll add, is that the Professor does not seem to curve. So as long as you are okay with a 88.9% being a B, then I don’t see a reason not to take.


    Semester:

    Regression is one of the most important concepts in analytics. I’m glad I took this course after I was on the fence, as I learned a fair bit and (slightly) improved my skills with R. The course content is generally well designed and organized, with plenty of real-world examples offered in the lectures/slides.

    On the other hand, as others have mentioned, the lectures are presented very monotonously. A little enthusiasm (or, alternatively, room to go off-script sometimes) would have gone a long way. Some students only read the transcripts, and I understand why. No voice intonations or extra enthusiasm for interesting or important concepts was present. That was disappointing.

    I thought the homeworks and exams were very reasonable. The key is to make good cheat sheets for the closed-book portion, and a good R-code index for the open-book portion. I don’t recall any huge curveballs on the exams either, and you can have homework/practice exam solutions pulled up while you complete the open-book portion. With the lecture transcripts available, you can pretty much cntrl+f the homework questions.

    TLDR: good/important course content, poor lecture videos, reasonable homework and exams.


    Semester:

    Most of the Cons mentioned here are rather true. The wording in the lectures and typos in the transcripts can be frustrating. I stopped watching the lectures because of the sound quality issues. The lectures are delivered by rather bland script reading, which is fine. The issue about the sound quality and frustrating wording can a problem.

    There was some frustrating for me in terms of interpretation. There are things said in the lecture that seem normative but is represented as relative in the homework/exam, for instance. This may be the result of my overthinking it or maybe the wording of the homework/exams/lectures really is less than optimal. But it was a bit frustrating.

    The homework is fairly easy. The exams resemble the homework. The R portions of the exams are much easier than the homework. But there is probably unnecessary frustration and anxiety caused by the lectures issues given that the exams are so heavily weighted.

    Overall, I learned a lot and enjoyed the homework.


    Semester:

    This class is mostly good, I got a decent amount out of it but there are a few caveats that must be made mention of.

    Firstly, the videos. They’re terrible. Not because of Prof. Serban’s accent, but because of the constant halting starts and stops through every single topic, it is so mentally exhausting to follow along with that 20 minutes feels like 3 hours. Fortunately, students from previous semesters have compiled transcriptions of all the videos and as such I didn’t watch any videos beyond the first attempts at the start of the class. The notes had everything I needed to do well in the exams and homeworks.

    Secondly, it is important to have had previous exposure to many basic statistic topics to really be successful. Officially that is true of the entire OMSA program but some classes will hold your hand more on that front, (e.g. Simulation boot camps). I wouldn’t recommend taking this class as your first statistics class in the program if you have no previous experience with things like hypothesis tests, probability distributions and their various properties, this will help ease any confusion when it comes to specific properties of regression statistics that arguably aren’t covered strongly in the class from first principles.

    Taking the above two points into account, I think this is a pretty good, though perhaps too lightweight, class. I learned more about regression than I knew before the class, all those funny numbers in the output of the lm() function in R have meaning to me and I got to reinforce some previously learned statistical concepts, it’s hard to complain too much about that.

    The structure of the class is fairly simple, for Summer there were 4 homeworks each with a multiple choice and coding component completed separately and two exams, also with both a multiple choice and coding component. Exams made up the vast majority of the grade (85% in total) so be prepared for some high stakes testing. As I mentioned above the course felt lightweight, even over a summer semester, it probably would have been possible to cram in the optional extra material as part of the main course which could have been nice.

    If you need a stats elective and aren’t sure what to take, consider Regression after taking into account the negatives I highlighted above.

    EDIT: Updating this review to comment on the fact the TAs were high quality throughout and Aymee, the head TA, was on top of things the whole semester. That’s not always case!


    Semester:

    The negative aspects of the course that are mentioned in other reviews are very fair. For reference I should be getting an A.

    Pros - The material is very interesting and I felt like I learned a lot, just not as much as I was hoping to. This group of TA’s are up there with the SIM TA’s as some of the best in the program for me so far. They’re very responsive on piazza and take the time to really explain things. The course goes at a very relaxing pace so it’s a good option to pair with another harder course. Coding homeworks really helped you learn the material and drive home the concepts. Coding portion of the exams also felt fair despite feeling like they should have added an extra 15-30 minutes to the timer.

    Cons – The course felt too easy at times. I could go weeks without touching the course materials. Lectures could be greatly improved. Homeworks are due every 2-3 weeks depending on the module, and it really felt like they could have added another module or two to the course, especially when they have an optional one at the end. M/C and T/F questions make me feel bleh. Professor very rarely participates in the course/on piazza so really it felt like another course run by TA’s.

    Nonetheless I have to say I enjoyed the course overall despite its flaws. It was low stress and taught me a lot about regression. Hopefully they improve it in the future for everyone else.


    Semester:

    Despite many of the flaws pointed out repeatedly in previous reviews I think this was an interesting class, in which I learned quite a bit. Yes, the lectures are tedious and you are far better served reading transcripts (and the crowdsourced bible - which isn’t always completely up to date with some critically new sections), but combining that approach with office hours was a solid way to learn the material.

    I liked that the homeworks were based on the lecture material (vs. forced googling of functions/code/concepts) and that those HWs translated more or less directly to the tests, and I liked that the coding portions of each were broadly practical in nature without (too many) surprises that we had not been prepared for. It did make me feel like I was learning the concept and then applying it.

    I, like others, did not think the T/F and MC portions of the HWs or test were representative of the knowledge gained, and if you are the type of person who cannot stand getting a B in an overall course mostly because of poor performance on narrowly specific multiple choice questions, despite feeling like you have a solid grounding of the material, then I can see how this class might drive you crazy. Mean grades on T/F, MC portions of tests were routinely in the low 80s and in many ways 10-20 hours extra studying wouldn’t have helped. Grade inflation isn’t always the answer but it might help a bit in this class… If it were me I’d weight the coding sections more heavily (70/30) and give a bit more weight to the HWs and it would probably reverse half the negative opinions on this site.


    Semester:

    Regression is definitely an important topic for anyone interested in analytics. Unfortunately, I don’t think this is a particularly good class when it comes to teaching it.

    The effort you put in and your knowledge of the material don’t feel like they translate as directly to a good grade as they do in some of the other OMSA classes like CDA and HDDA. That said, I don’t think this class requires much effort. I probably spent less than 10 hours per module (there are 5), plus another 10 or so studying for/taking each exam.

    I rated the difficulty as medium not because the topics are hard, but the evaluation is so poor. You grade will be determined by the exams, and in my opinion they aren’t well done. With more straightforward exams, I’d rate this class easy or very easy.

    The True/False and multiple choice questions don’t seem to me like they are designed to test your understanding of core concepts as much as they test specific details of definitions or rules and whether you can read carefully without getting tripped up. Make a very detailed reference sheet for the exam and read very slowly and carefully. Then re-read again just to be safe.

    The coding part for homeworks is very straightforward. For the most part you just take adapt code from lecture examples and apply it to the answer the questions in the hw. If you got through ISYE 6501, this should be more than manageable.

    Unfortunately, the coding part of the exams sometimes ask you to use a function that may not have been presented very thoroughly in lectures (if at all), so be careful. Make sure you’re comfortable with every function presented in lecture, even if the professor glosses over it.

    Make sure you’re comfortable searching and reading the R documentation in Rstudio (“?function” and “??function”). You’re not allowed to google to find functions during the exam, but if you’re stuck you can look them up or search for them in the Rstudio help. It saved me a few points on the final when it asked for something I swear wasn’t covered in lectures (of course maybe I just missed it among the mountain of slides in this course).

    I also found it helped to create a table of contents of sorts with what functions/models/questions are asked in each homework and practice exam. Then when you’re working on the real exam, if you’re stuck on a question, you should be able to quickly track down the needed code from a prior example. Despite having 1.5-2.5 hours for the coding exams, I used all the available time for exams 2 and 3.

    Edit: I’m current taking MGT 6203 and the course begins with an extremely basic module on regression. While I wasn’t a huge fan of the instruction in Reg - when compared to 6203, I think Reg is much, much better, especially when it comes to the homework assignments. I had initially rated the class “Dislike”, but grading within the context of the other classes, I’m bumping this up to “Neutral”.


    Semester:

    I felt compelled to write a review for this course. Regression is in many ways the foundation of data science. I have taken the 3 required courses for OMA and thought that CSE6040, ISYE6501 were very good (especially I enjoyed Dr. Sokol’s class a lot). I though MGT8803 was a bit of a drag (especially the content for accounting and the presentation for finance were boring as hell, however supply chain and marketing presentations were good). Once I finished the 8803 class, I have been able to look at everything around me through that lens and has been rather useful (especially understanding balance sheets, cash flows, present value, future value etc has been useful in understanding the stock markets). I struggled through all 3 classes at various points but managed to score A’s in all 3.

    Coming to Regression Analysis, I was very excited to take this class since it is the first class where we’d deep dive into a key aspect of data science. We are heading into the second midterm next week and I would say this class has been disappointing and underwhelming at best in terms of both content and the presentation. In all my previous classes, after having gone through the videos and gaining a basic understanding of the concepts, I read more on the topic and saw a lot of videos of youtube and took a lot of notes from them as well. While I understand that as grad student we have to do a lot of additional work ourselves to gain a deep understanding of the topic, this is the first class where I felt that the material covered didn’t provide a good foundation of the concepts. Example: the way MLE was covered in logistic regression part is so bad that I didn’t understand it after listening to it more than a handful of times. As important as a formula may be, can we not get a good visual of what it is that we trying to learn before diving head first into it? I would myself have dismissed these things as the student not wanting to learn previously, but given how many videos there are on youtube which provide excellent visuals and explanations of concepts, I can’t imagine why a paid graduate level course doesn’t do the same? I mean, we can do pretty much all the model creation, prediction, estimation etc in R. It would do the student much more good to understand (visually or using words), what the concept is, rather than trying to simply learn the formulae. If you don’t believe this, watch the 3brown1blue channel for the visualization of linear algebra, you would see the subject in a whole new light and wouldn’t forget ever what linear algebra is and how and why it is useful. Another example would be the statquest channel for things like linear regression, logistic regression etc.

    Often times, there is cursory commentary provided by the prof and then concludes with ‘the information on it provided in the slide’. It is ironic that I have to write this as someone who also has an accent but the prof is difficult to understand and I have to always follow along with the transcript. This would hardly be a problem if the material presented is good but alas!

    I hope the class is restructured, more inspiring content is added and more time is spent on explaining the concepts. For now, I just want to finish the class and spend the summer learning more about regression elsewhere since I hope to start using it in my present work.


    Semester:

    I think this course is underrated. As someone who took different variations of regression for 4 times in undergrad, this course is my favorite. The course is highly structured. Unlike many other courses, the codes are provided in lecture slides so you don’t need to Google around (I always struggle with R). The lectures are not bad at all. However, the downside is that this course spent way too much time on univariate regression. I wish it could touch more on advanced topics in regression, panel regression or something. Overall I recommend this course to everyone.


    Semester:

    5-homeworks (T/F and coding portion). Three exams.

    PROS:

    Lots of opportunity for regression analysis in R (i.e., hands-on practice)

    Lots of good study material: knowledge checks, homework’s, practice exams

    Format of practice exams prepares you well for exam

    TA’s were knowledgeable & kind

    I appreciated that the video lectures were split between: (1) introducing and explaining topics and (2) real world examples (implementing the code, reviewing the output, interpretation)

    CONS:

    Hands down THE WORSE PowerPoint slides I’ve ever encountered. I loathe them. On their own (i.e., without listening to videos) they are useless. Additionally, they contain grammatical & spelling errors (unacceptable) and they divided into many subsections (e.g., module 4 has 21 different PPT files). If you need a 500-page transcripts document to accompany your PPT slides, there may be something wrong! The slides actually made me angry! Slides are not of Georgia Tech quality. They are poo and do not aid in the learning experience. In fact, I’d say they de-aid-ed my learning experience.

    Little to no interaction by professor. Almost 100% TA lead (and they do a good job)

    Slow response time on Piazza! Sometimes days…

    Doesn’t always cover the why or reasoning

    Other courses can and do present the material better and more thoroughly

    Coding portion of exam would focus on a topic we barely touched on in lecture (last 5-slides of 160 slide deck) and did not practice coding at all (e.g., random forests). This approach seemed more like a scramble/superficial exposure to a topic.–and counterintuitive to the learning process for me.


    Semester:

    Overall I learned a lot from this course. I have basic regression knowledge coming in and this course expanded my knowledge greatly on advanced regression concepts. I appreciate that this is an applied course, which requires hands-on R experience. The lectures go over many R examples and the R files were provided for you to follow. It is hard to get an A in this course due to the difficulty of the multiple choice portion on the HWs/exams. But it was fair and I balanced it out by doing well on the coding portions.


    Semester:

    Tl;dr: This course is a must-take for aspiring data scientists despite some hiccups here and there.

    Course content

    You will learn regression in-depth with this class. The models covered are linear regression, ANOVA, logistic regression, Poisson regression, and variable selection techniques. You will also understand the mathematics behind the models, the assumptions if a regression model fits your data, and much more (confidence intervals, how to identify outliers…). After taking this course, I can say that I feel much more confident when I talk with my statistician colleagues. The only thing that I wish prof Serban covered is the algorithms to determine the logistic regression models’ coefficients.

    Teaching style

    Prof. Serban can be a little bit dry, and her accent might be hard to get accustomed to, but she is doing a fine job. Yes, she reads from a script, but she generally goes straight to the point. The videos are not too long, nor too short, but consuming an entire module in one sitting can be a little too much.

    Course structure

    The course has 5 modules, and each module spans over two or three weeks. You’ll have 5 homework which consists of a coding part (6501 style but with clearer instructions/expectations) and an MC part. Some say that the multiple-choice questions use tricky wording, but I disagree: the questions require you to think, if you understand the concepts, you’ll get the right answer.

    Exams

    3 exams: 2 midterms and 1 final. Each exam consists of two parts: one coding part and one MC (like the homework). Both are proctored and timed. The coding is open book, but not open internet (you can still go on edX, however). The coding part can be stressful, as you’ll have barely enough time to knit your markdown file before you are out of time. Suggestion, prepare an R file with all the useful formulas before the exam so that you can copy-paste lines of code. Also, you are allowed to use cheat sheets during the MC part. Finally, you have about a week to complete both exams.

    Prerequisite

    An OK knowledge of R is required (basic for loops, how to create a function), but nothing too fancy. If you survived in 6501, you’ll be fine. You also need to have a decent knowledge of stats and probabilities (mostly stats). Some linear algebra background could be handy (matrix multiplication, the inverse of a matrix): if you survived 6040, you’ll be fine. There is a bit of calculus in some derivations, but it is mostly basic.

    TAs

    I did not really attend the weekly office hours, but I watched a couple. They are mostly open discussions about the current module’s topic. If you struggle, I recommend that you attend them. The TAs were also pretty responsive on Piazza. They sometimes went above and beyond with some of their posts.

    Outside documentation?

    Some other reviewers mentioned that they were using resources from other universities. TBH, I don’t see the point since the course is self-contained (no need to buy a textbook, either).

    What could be improved?

    Others mentioned it, but the sound quality of some lectures is beyond bad. Some videos are extremely quiet, while one was recorded with distortion. One TA said that it would be fixed, so who knows?

    Conclusion

    I really enjoyed the class, and I would recommend it to anyone in the program. Note that I took it back in 2018 but had to withdraw for personal reasons. Thus, I can say that the new lectures are much better. Also, the homework was more enjoyable to complete. To sum up, take this class if you want to gain a deeper knowledge of a basic but fundamental data science concept.


    Semester:

    I took the class after the recent restructure. Video lectures need polish. I don’t agree with the exam structure as far as weighting goes, but content I am mostly good with. The final exam this semester was not a good match. The style of question was significantly different with much more emphasis placed on interpretation than previous exams or homework. This was mentioned by a TA, in office hours, that did not get posted in the usual place. Instead it was mentioned in a comment in a forum thread.

    This could, and I think should, change. If interpretation is going to be barely a topic on previous exams then it should remain so on the final, not increase in both coverage and depth. instead of select the correct option some of those questions became select all right answers. At east a couple of final exam questions were very arguable on interpretation (I am writing this before seeing how they were graded). One question had no right answer, and two that were equally not horrible.

    Homework was very fair in content and depth, peer grading is a quick way to let a class with 200+ students be run by a handful of TAs and one professor. It also means very different grading standards. Using the median mark for a grade is a good solution. I don’t know of one better.

    Being allowed to use the previous homework on the coding portion of the exams means you were graded on ability to problem solve than memorize the R library. I think they have found a good balance of open note but not open internet. Although reading CRAN descriptions on library functions leaves much to be desired.

    Solutions are provided, but if you don’t understand them work on seeing how that code does what your own code does.

    Advice: From the start of the course create a list of which topic was covered in which homework. This will save you a lot of time on exams.

    The class has a bad reputation for both quality of instruction and difficulty. I relation to the other four classes I’ve taken so far I understand why. The instruction and question phrasing combined make the class more difficult than it needs to be.


    Semester:

    Overall I think this is an interesting class. It provides a deeper knowledge of 6501. You would get more out of this class if you study, do the homework, attend office hours and try to understand the material.

    Video lectures/materials - I would skip the lectures since the audio is bad and it’s very hard to understand. She just read it out of the slides and transcripts.

    Homework - The TAs are extremely helpful and there is office hour every day. They will help you learn and get the most of homework. HW is 2 parts (MC and Peer Reviews). Peer Reviews can be hit or miss. Some people grade more lenient than others. The point is to learn, not to punish your classmates. You can ask for re-grade from TA if you find it’s unfair.

    Exams - I doesn’t like how 85% of your grade is allocated into 3 exams (2 midterms & final). It has 2 parts (closed book MC and opened book coding). The MC is designed to trick you so make sure to read every question carefully. The coding part can be long, so make sure you have a reference code. You can’t use internet to trouble shooting but you can use help in Rstudio. Most of the coding part is similar to the homework with some nuances.

    I would recommend this class if you want to gain more knowledge of regression. I did learnt a lot from this class.


    Semester:

    Cleared due to OMSCentral Owner being greedy.


    Semester:

    I mostly agree with the other Summer 2020 reviews. Here is my personal addition:

    I do not do well with standardized testing. I would much rather be graded on projects and essays than true/false or multiple choice questions. I end up second guessing my knowledge and over analyzing my way into the wrong choice. Half of this course is project based and half is standardized. My project grades were always above average with the class and my tests were always below average.

    Here is what I appreciated about the course: The TAs and Dr. Serban take time to respond to you on Piazza. After tests I would go through and analyze my answers. The ones I got wrong I would PM the instructors with my reasoning for each one, and every time they responded with detail. In some cases, they understood my logic and reevaluated the grade. They genuinely care about learning and if you take the time to interact with the instructors they will show you that they want to help you succeed.


    Semester:

    For summer, this course pace is managable. I only used powerpoint slides and those pdfs to study. While the workload and pace is pretty okay, the midterm and final exam takes out a huge percentage of the course. The midterms were manageble, but the finals were pretty tough.

    There is not much or close to no curve for this course as compared to other statistics/ math courses in ISYE. I was a bit disappointed because I thought I understood the material as I did well for homeworks, but the finals part 1 was tricky and I didn’t have enough time for part 2. (and the finals took up 45 %!)

    If doing for summer, I would recommend just doing this course only. I had a B up until midterm, but ended up with a C because of the finals.


    Semester:

    Reading the previous class reviews made me nervous to take this class. According to the Professor and TAs Summer 2020 used new lectures, videos and homework assignments. I feel some of the previous comments are still very relevant and there are still ways to improve the class.

    I cannot comment on previous semesters but the lecture videos this summer were a challenge. Audio was extremely quiet for a lot of the videos. I ended up using the transcripts and powerpoints without watching most of the videos. If you take this class I strongly recommend combining the unit powerpoints. I will come in handy for Part 2 of the exams and studying in general.

    Homework & Exams all the two parts. Part 1 is T/F and Multiple Choice. Part 2 is R based code questions.

    There were 4 homework assignments (each two parts) and 2 exams (each two parts).

    Part 1 (T/F & MC) was a challenge. For exams and homework you’ll want to take you time here and be sure to really read the questions. There were a lot of little words to make this portion very tricky.

    Part 2 (Coding in R) I didn’t feel was overly challenging. Most of what was being tested was covered in lectures, however there were a few homework problems that were not explicitly in lectures.

    Exams were similar, both parts were proctored. Part two was timed and proctored. This made doing the coding a bit of a challenge. While it was open notes there was not internet. I was very close on time for part 2 for both the midterm and final.

    Although the class was a challenge I do feel I learned a lot. If you aren’t familiar with Regression I would recommend this class. Just be sure to devote time to the Part 1 as the wording can be challenging.


    Semester:

    Well, after hearing mostly negative reviews of this course for 3 years I’ve finally taken it. Although we cover regression in other courses (6501, 6203, etc.) this course goes a bit further into things like poisson regression, logistic regression etc. All in R, which I quite enjoyed.

    Bottom line is it’s a solid intro to regression pitched to the graduate student. Taking it will not make you a regression expert, but it will serve as a jumping off point if you want to learn more. I, for example, would like to learn a little more about the econometrics side of things, and think this course will be good to have taken perhaps before diving into that subject.

    The text is more or less unnecessary. I actually bought it and wish I hadn’t as all the examples are in SAS(!?!). Just skip it. The course transcripts and google will do the rest. The text doesn’t even cover the lasso, ridge, elastic net portion of the course.

    Supposedly, the videos have been “improved” but if that’s the case, I’d hate to see the old ones. I think I could do a better job with my iPhone and a decent mic and, maybe, some white boarding software. I mean, where’s the creativity? Where’s the character? I’d be thinking of ways to maybe make the lectures fun and entertaining. Like the Isbel ML lectures on Udacity. Is that so hard? But then I’m not a stem prof with a 147 IQ. You’ve heard about the bias-variance trade off? I propose there’s a personality trade off involving mathematical ability and ? It’s telling that aside from Goldsman, the other most engaging professor (Lee Campe) was from the College of Business. Heck, even the financial modeling (also college of business) prof showed a little personality.

    I am nearly done with this degree so can now say that, truly, Ga Tech ISYE and CSE professors aren’t hired based on being engaging lecturers (except for Goldsman!) No, they are hired for their research, or publications, or whatever, but not because they’re good teachers.

    The lecture style was basically the prof clearly reading a script. I mean, just watch one Khan academy video and you’ll see a much more engaging teaching style than anything in virtually any class here. Sigh…Bottom line, lectures are boring, but “those who’ve came before” us have created transcripts of the all the videos so you can just read them (and use them for reference on test/homework day).

    Okay, so what’s good here? I did note that I “liked” the class, after all. The topic selection is solid, the homeworks were very educational and took a bit of work. And the tests were mostly reasonable (past years people had complained that they were “tricky”). I didn’t feel that way myself.

    I should also note, that, unlike other courses I’ve taken here (coughbayescough) there ware more than adequate TA coverage and the TA’s were responsive and helpful. The class was well run. Although I didn’t avail myself of those resources, I could clearly see as much on piazza, etc.

    How hard is the class, in summer? Frankly, I put relatively little time into the front half, got an “A” on the mid term, then for whatever reason, did poorly on the final. Ended up with a B. The fact that two tests make up 85% of the grade is a bit weird (and my case shows why) but in spite of my apparent poor final prep skills, I don’t think the class is overly hard. Nothing is particularly hard to understand. The homeworks can take several hours. Usually I’d just block out every other Sunday or whatever for “homework day” and the rest of the time, things were, frankly, pretty relaxed. I may have averaged 10 hours a week and that probably an overestimate.

    You have to take two stats classes. If you’re AT track, you’re almost certainly taking this. If you’re comp track, you might want to take HDDA, CDA, etc. If you’re biz track, I think you should definitely take this. It’s an important foundational subject in analysis that still drives a lot business models out in the real world.

    TL;DR, good subject, topic selection, boring lectures, high stakes timed, proctored exams that can be a little stressy.

    Good luck!


    Semester:

    This class was apparently redone for this semester - I’m not sure if the redo is finished or what actually changed, though. So I have no idea how similar the class I took is to the one that got mostly negative reviews, or the one you might be considering taking if the class is still under construction.

    Overall I liked it quite a bit. Learned a lot of very useful knowledge - regression is such an important part of analytics, I now know so much more about variable selection, proper use of glm vs. lm, etc. than I did before. There are still a couple of grades pending and I’m unsure if there’ll be a curve, but odds are I’ll wind up with a B.

    My biggest gripes are the quality of lectures and exams. With lectures, Dr. Serban oftentimes gets too into the weeds. And, she’ll reuse the same example over and over - which is fine, but therefore assumes you remember something very specific from five lectures ago. For instance, let’s say in a given model the R**2 value is 0.7. She’ll then refer to 0.7 in a later slide without context, leaving you to scratch your head, like where did that number come from?

    The exams have two parts - a proctored coding section where you essentially have a couple of hours to do a homework and can’t go to the bathroom, and a proctored multiple-choice section. My complaint is the multiple-choice: many of the questions seem designed to trick you. Heavy use of words like “never” and “always” where I made myself crazy trying to think of a single exception. Occasional questions phrased very confusingly with double/triple negatives. And she asks a few obscure questions that were briefly mentioned once and never again - though we’re allowed cheat sheets there’s just too much information to write down/memorize for every tiny detail. So the reason I graded the course as hard isn’t because it’s extremely time-consuming, but there are a lot of pitfalls to earn as high of a grade as you might like.


    Semester:

    Overall, I found the course to be valuable. It was very organized and well run by the professor and TAs. If I had one criticism of the course, it’s of the video lecture quality. Not only is the audio bad, but the professor is not a great presenter and essentially reads from a teleprompter which makes it difficult to stay engaged. However, they have a well organized transcript and lecture slides to go along with the videos which is what I used primarily. The HWs and exams were decent and organized. There are a number of data sets used in the HW and exams that force you to apply the concepts (using R). I’d say I came away from this course knowing a lot more about regression than I did coming in.


    Semester:

    Positives:

    • The topics covered are exactly the right ones for a regression class and I feel that I learned relevant material
    • The TAs and student community was great and I felt I could easily reach out on the slack channel or piazza for help
    • The assignments and exams were well designed, and the lecture knowledge checks and coding examples were ample and really help you get a feel for how the concepts link up to practice
    • The topic is taught in a standard way, so other resources like Penn State’s STAT 501 class, regression textbooks and wikipedia line up pretty much exactly
    • Transcripts/notes/powerpoints are provided for all the lectures and are well-organized

    Negatives:

    • Lecture videos are reading off a script, often with lots of formulas rather than emphasis on visuals and concepts

    Generally, I really liked the class - the assignments were great learning exercises for me and the best part of the class. I think the lecture videos are not good and for that reason some reviews will criticize Dr. Serban, but I think the other aspects of the class more than make up for it - most professors, even those with good lectures, will not provide as many resources, examples and helpful assignments as this one did.


    Semester:

    What saved me from giving this class a “Strongly Disliked” rating were the TAs. The TAs in this class definitely made it doable and help you feel like you’re learning something. They were super responsive, and explained topics much better than the professor.

    Dr. Serban’s teaching style is definitely not for everyone. You’re honestly probably better off not watching the videos and just downloading the slides/transcripts and walking through it yourself. It really just felt like she was just reading from a script and didn’t provide any additional insight to various topics taught in this course. Not to mention, the quality/sound in some of the videos made it sound like she recorded the videos in a dungeon. The thing is, you take these classes hoping you learn something in addition to your own studies. Without the TA’s, I’d have felt more lost after taking this course. The professor provided absolutely no value to me.

    Homework was pretty straightforward. Exams were fair. I never thought I was asked anything that I wasn’t exposed to in class. You don’t need to have R experience to take this class, but it definitely helps.

    Did I walk away from this class feeling like I learned something? Yes. Was it because of the professor? Hell to the no.


    Semester:

    Overall this was a good course and I learned a decent amount. This was my 9th course in OMSA.

    The class had 5 real units

    • Simple linear regression
    • Anova
    • Multiple Linear Regression
    • Logistic/Poisson Regression
    • Variable selection (forward/backward, ridge, lasso)

    Pros:

    • Homework structure (mixture of multiple choice + T/F and applied R coding) was conducive to learning.
    • Workload was not at all demanding, even during the condensed summer schedule (although I was only in one class this summer). There were four homework’s and two exams all of which had a multiple choice and coding portion.
    • Most importantly, the TAs were highly engaged and most of them were very good. I thought Michael Tritchler was especially good and really took a lot of time to make great content and explain concepts.

    Cons:

    • The lecture quality is poor and nobody taking the course will get much out of them.

    Overall: The past negative reviews make me see this course as a bit of a gamble. If your TAs aren’t good, you’ll end up with questions that are senseless, confusion on homeworks, and less learning over all. If they are, you can expect to become better at regression through this course


    Semester:

    This is definitely a must-take course for all 3 OMSA tracks, and I believe it should replace MGT6203 as a core curriculum class. The class focuses on how to interpret the results and judge on the output and you will be given many detailed examples with various types of data sets. However, you will not dive deep into the computation behind the algorithms, and will mainly perform regression using R as a black box. Still, there are other courses that focus on the computational aspect, and I believe that the information and skills you will get in this class are invaluable. I also had a lot of fun going through the exercises and interpreting the results.

    It seems that most of the criticism this class gets comes from the TF/MC quizzes and exams and that some questions are tricky and misleading. Although it is surely frustrating to lose grades based on semantics while fully understanding the material, I found only 2-3 (out of 20 to 40 questions) of these in each quiz/exam, and I found the rest of the questions logical and test your understanding of the material. On the other hand, the other half of the exams were R coding questions that were definitely doable (and even easy) if you watch the lectures and do the homework, and the grading was lenient. That being said, if you overlook a couple of points you might loose on tricky questions that will probably not affect your overall grade that much, you will find this to be a light and a not-difficult course.

    It should also be mentioned that, although the TA’s were great and responsive, the office hours were mess. I believe that instead of the dozen weekly office hours that were poorly organized and resulted in almost no one attending, a 1 or 2 well structured office hour will be much more helpful!


    Semester:

    This courses is an example of how not to teach a course.

    1. Poor lecture quality.
    2. Awefully worded english transcript, defintely a different era of english.
    3. Unstructured office hours which were practically unattended.
    4. Horrible multiple choice test which does not care of your understanding of the course but more of trying to trick you. Given point #1 and #2 you are definitely going to fall in the trap.

    Having taken the course i wish i knew what i know now.. and wont have taken the course even when gatech paid me for it.

    Glad TAs were there to support otherwise would have lost my sleep over it.


    Semester:

    I’m writing this review in Spring 2020 to offset all the super negative reviews about this course. This course is not THAT bad. There are definitely issues with it, but after taking it I can definitely say I learned new things about Regression (even though this is one of the final classes I’m taking in the program). Within the program, ISYE 6501 and MGT 6203 also covered Regression. This course helped me learn new bits about regression that I found useful to my overall understanding. With Regression as a highly used analytics model, I’m glad that I took this class.

    The good:

    • In depth focus on simple & multiple linear regression, ANOVA, Logistic & Poisson models, and Variable Selection.
    • I think the course does a good job in forcing you to understand how to assess the model fit and understanding what the outputs of the model are. This includes variable interpretation and overall model significance. It sounds basic, but I think people who think they understand don’t actually get it at a detailed level.
    • Pacing of the class was fine and the concepts built upon each other
    • The exams were fair and had plenty of time to write your R code and answer the questions
    • The TAs for this semester were very good and helpful

    What could be improved:

    • As stated in other reviews, there are lots of little errors in the videos and I think the professor said herself that they are redoing them shortly. Some people find her hard to understand, but I didn’t have that problem.
    • The lectures can be a bit dense, but I found it most helpful in the “data examples” in applying the concepts to a real problem.
    • The reliance T/F questions in HWs and exams is frankly annoying. The questions are written with some grammatical errors and you can tell where they are trying to “get” you. It’s an annoyance that you’ll have to get over.

    For people considering this class, please don’t avoid it with all the previous reviews. There are good things to learn here even with various annoyances.


    Semester:

    This course gets some pretty poor reviews. I think some of the reviews are justified. In the end, I think this is a course worth taking. I think this course could be improved, however. IMO, this is probably the most fundamental aspects too analytics. Here is my take on the good and the bad: (For reference, I expect an A in the course.)

    The good: The TAs for this course did a pretty good job. They were not the most responsive, but no questions went unanswered for more than say 48 hours. Even the really silly open-ended ones (typically asked by yours truly.). In the end, I left this course knowing so much more about regression that I did before, and much more detail that is covered in 6501. In the end, I am glad I have this knowledge.

    The TAs shared a transcription document at the beginning of the course that had pretty much everything the professor said. That was tremendously valuable. I was able to search for terms and look things up quickly without puttering around in video files.

    In general, the TAs were tremendous. If it weren’t for the high quality TAs, this course would have been a disaster.

    The bad: Heavy sigh. These comments are meant to help improve the course. It is a graduate level course, and I don’t think that it needs to spoon-feed students. If you want to be spoon fed, I don’t think GT is really an appropriate program, IMO. That said, there are some major areas for improvement…

    Syllabus: The syllabus is just plain wrong in many aspects. The described final described in the Syllabus was completely wrong. One of the midterms was scheduled over spring break. The instructor did respond to a couple questions over the semester, but was largely absent.

    Course pacing: The course seemed to be paced oddly. Some of the weeks had multiple things that were due, and at other times, things seemed pretty spread out.

    Errors in materials: The slides and the transcription were incorrect, and not just in a few places. I found at least 10 errors in the slides and the transcription document… and they were not just typos, but the kind of errors that could really put a person on the wrong track.

    Textbooks: The first textbook was valuable, but the second one on generalized linear models was not useful at all.

    Extermal Resources: I probably spent as much time on Penn State’s website and looking at Introduction to Statistical Learning in R than I did with the course materials. That doesn’t speak well for a class that I spent over $1k to take. It is really frustrating, and makes me angry that Georgia Tech didn’t do a better job offering the course than Penn State. Seriously makes me mad. Come on, GT, we are better than that!

    Exams: The midterm exams had questions that were worded pretty poorly, and really split hairs. I didn’t get the impression that the intent was to trick people, just that the questions needed to be more direct and worded better. For non-native English speakers, I think that the was significantly more energy required to parse some of the questions than the underlying concepts.

    Topics covered. This course doesn’t cover PCA or CART, which is really unfortunate. It goes into a lot of statistical details, but doesn’t spend much time on the linear algebra perspective linear regression. Also, I felt that the explanation of regularized regression techniques (Lasso, Ridge and Elastic Net) were actually covered better in ISYE 6501.

    Office hours: I would recommend fewer office hours, but with much more structure. The office hours were mostly good, but they were completely open ended. Having some structure would have really helped, and made the much more worthwhile. TAs gave A TON of these sessions, it is too bad that they could not have been a better use of time.

    I do hope this course gets updated in the future. It is worth taking, but can be frustrating.


    Semester:

    I’m very glad to have taken this course, though my relatively weak stats background (I’m more of a probability gal) made it a challenge, especially the first 1/3-1/2. You might think an entire course about regression analysis would be a bit light, particularly if you’re already experienced in machine learning; you’re fitting a line, right? Haha, no. When you’re done with this class (or it’s done with you), you WILL understand regression - and in particular how to evaluate regression models, choose among models and figure out how reliable their estimates (or predictions) are.

    Downsides: the wording of exam and homework questions is sometimes a little off, and it requires a certain amount of teasing out what they really meant to ask. (The instructor is reasonable about these, though, and if the wording is outright incorrect she will make sure points aren’t deducted.) It’s easier at the end of the course than at the beginning.


    Semester:

    The course by itself is very important from a Data scientist perspective . It took me some time to get acclimated to Prof Nicoleta Serban’s accent in the lectures. Course content is good with the right balance between underlying math,stats and application. Lot of examples covered in the lectures, Assignments and exams. The combination of open and closed book assignments helped rehash the concepts and application of the concepts clear. The time for the open book section in exam is not sufficient though. Easy to score an A in this course. Mike, the TA was exceptional.


    Semester:

    Overall, the course is not very enjoyable. The content itself is important and crucial to know as a data scientist, but the content provided by the course does not do a good job of teaching it. There are numerous mistakes in the lectures / slides and they are hard to get anything out of. Expect to refer to ISLR or other reference textbooks to get a grasp on the material. There are unclear questions in the homework and exams, including a decent amount of “gotcha” style questions that don’t test your true knowledge of the content.

    The workload is manageable, so the course can be pretty easily paired with another course. Hopefully the course is re-done soon, otherwise it may be worth avoiding unless you accept that you will be learning finding your own resources and dealing with some mistakes and wording issues.


    Semester:

    I strongly recommend taking another course. Please read the below reviews. You’re welcome.


    Semester:

    This is the worst class I’ve ever taken. Bad lectures, bad assessments, bad TA’s, bad everything. I have an A and expect to finish with one – I’m only bitter, because I want to learn, and if that’s your goal, this class is the literal worst and a total waste of time**. If all you want is a 4.0 GPA for a useless degree and no hireable skills, then go for it.

    ** instead I’d recommend going through the first 6 chapters of Intro to Statistical Learning (free book + MOOC) on your own time. It doesn’t cover everything, namely ANOVA and Poisson Regression, but it covers all of the useful stuff and it’s so much better. At least 10x better.

    Note: it’s possible the class will be improved in the future as I’ve heard from Dr. Sokol himself that it’s being worked on… until then, STAY AWAY


    Semester:

    I really disliked this class. Many other posters have touched on my points, but to recap:

    • Low quality production: There are multiple lecture videos where the professor pauses, says “wait wait we need to re-take this”, and then starts from the top. Apparently nobody did a review of the videos before posting, because the final edit includes all of the outtakes. It feels cheap, considering the lectures are many semesters old.
    • The lectures are dry
    • Half of the homework, and about half of the midterm/final, is in a T/F or multiple choice format. It seems like very little thought was put into writing these sections of the homeworks and tests. This really isn’t a program/topic that should be taught through rote memorization (when in the working world is your boss going to do a pop quiz on the number of degrees of freedom for an ANOVA residual??? how is that useful???), but combine that with abundant grammatical issues and various trick (“gotcha”) questions, and these portions of the course became nightmares. I’m a native English speaker and can only imagine how hard the class was for those who are not.
    • There were lots of TAs and most were super helpful. It was still a mixed bag when asking for input, though. I saw many incorrect responses from TAs that led students down the wrong path, and the lack of organization on Piazza meant that significant guidance/direction on a homework was buried in a thread and not pushed to the full class.
    • This seems like a small thing, but the entire term started off so stressful because the syllabus was so hard to understand. The content of the first few units was easy - basic regression, ANOVA, etc. But there is a language/grammatical barrier in the explanation that creates stress when nobody understands what is expected of them.

    It’s really a shame because the topic is so important, but I finished the summer term with more resentment than appreciation for the course. Hopefully the professor can put the required time and attention into improving the quality and teaching methods (no more T/F - they have no place in this program).

    Quick note on the 7hr/week estimate during the summer term. Homeworks were due every two to three weeks in the shorter summer term, so there was a good amount of content to cover in each. I typically spent 14 hours per homework, but that included 3.5 hours to watch the videos, 3.5 hours to read the lecture transcript (honestly, I had a hard time understanding good portions of the videos, so I read the transcript slowly and in detail), 1.5 hours to do the T/F portion of the homework, 4.5 hours to do the coding/open response portion of the homework, and 1 hour to peer review.


    Semester:

    This class was very straightforward. It basically covered basic concepts over again but in more detail. The homework weren’t too difficult but definitely took time. They are also peer graded- which meant little feedback and a wide variety of scores. Dr. Serban is also very hard to understand, so I didnt watch a single lecture- instead I read all of the transcripts.

    Her tests were also included some very detailed information- that one sentence from that one slide kind of question. She was also very hands off and let the TAs handle everything. So when you needed clarification on something you sometimes got great help and othertimes you did not. Really depended on the TA.

    Overall this course wasnt great, but wasnt too difficult either. I ended up with an A in the course.


    Semester:


    Semester:

    The course content alone should make this one of the more useful classes in the OMSA curriculum. I was really excited about gaining a deeper understanding of regression analytics after having a very good experience with 6501. However I was disappointed by the high number of mistakes in the materials and confusion among TAs on quizes and exams. The power point presentations seemed rushed and very low quality compared to Dr. Sokol’s lectures. I think there is quality assurance issue across the OMSA program and this could make the case for more oversight given that the delivery mechanism is conserved in this learning system.

    Good: Starter code and examples Topics were useful and insightful TAs were willing to help

    Bad: Errors in materials on exams T/F, MC questions (not all but too many) were unnecessarily designed to confuse/trick. Final was too long and many didn’t finish in the allotted time.

    Time: Compared to the battery of homework in 6501 this was reasonable load (4 HWs due bi-weekly, 2 exams).

    All in all I recommend this class but caution about reading the content carefully and be cognizant of the assessments and intention to trick you into the wrong answers.


    Semester:

    This is one of the worst classes I have taken. Poor lectures and bad exams with T/F questions that were pure memorization. The TAs were helpful which was the strong point as the instructor was virtually non existant. Overall the workload is not bad and I did like some of the homework exercises but would not take it again.


    Semester:

    This course was definitely a mixed bag. The biggest complaint I have is that the grading scheme doesn’t measure understanding of the material very well. I feel like I learned a lot from the peer-reviewed portion of the homeworks and had to learn markdown, which seems worthwhile. For the closed book portions, T/F and MC questions were mostly just worded poorly in an attempt to be tricky. the tests were pointless. “open book”, but timed. proctortrack is absolute garbage. I feel like i lost a lot of points because i couldn’t use my normal computer setup. The TAs actually were pretty good in this class. they seemed to care. Videos were not that great and could really use some editing. You get a little more detail on regression than you do in 6501 and 6203, but not sure it is worth all the hassle.


    Semester:

    Pros: Regression Analysis is a very important statistical analysis to know and it is very applicable to real life.

    Cons: Lectures are very long and it was hard for me to digest. I consider the best teachers to be the ones who can take a complex concept and be able to explain it in a way that a smart 12 year old can understand it. Professor Serban was not able to do this. I suggest you rely on external resources if you do not understand a concept.

    HW and Exams: Many people complained that the wording for the T/F and MC questions were tricky and they definitely are. But if you go in with the mentality that she is trying to trick you and read every SINGLE word in the question, you should be fine.


    Semester:

    I was very excited to take this class, as I really enjoy Regression Analysis. The content is interesting, and I do feel that I have a better understanding of regression analysis after finishing the course. However, the current design of the course is frustrating and led to a very difficult experience. The True / False component of the homework has questions written in a manner that was very difficult to understand, and it could take a very long time to unwind what the question was asking. I did think that they TAs and Dr. Serban were fair in regrading any questions that were too confusing. The R portion of the assignments were not unreasonable, and I appreciated the sample code provided by Dr. Serban. I think my largest difficulty in this class was trying to reconcile the lectures and how the content was being assessed in the homework and the assignments. There seemed to be a large degree of confusion between the TAs at the beginning of the course, and it was difficult to get a good answer. I did enjoy the numerous opportunities for office hours and the TAs were usually very responsive on Piazza.

    Overall, I learned quite a bit, but did not fully enjoy the experience, as I felt the assessments of learning were not designed in the best way to measure true understanding of the content.


    Semester:

    Overall, not a bad class. I learned material in addition to mgt 6203 and ISYE 6501.

    There are 4 HWs and 2 exams. Each of the HWs has 2 parts - a quiz of approx. 25 questions worth 50 pts and an R portion with some interpretation also worth 50 pts. Though I think peer reviews for the second HW part were useless in terms of feedback, I’m glad that we were graded that way because most students were fairly lenient and I mostly got 1 person who was harsh. At the end, since the median grade was the final one, one needed 2 decent graders to score well.

    Lectures: This is my biggest piece of feedback: fix the lectures. The delivery is dry and hard to follow. I stopped listening to them after week 2, instead pausing the videos and fast forwarding through the script. The lectures could be simplified - 1-2 videos on theory, 2-3 videos on practical examples. That being said, I found the examples provided to be really good, real life ones. My experience was to read Penn State’s online lectures after the videos so I can extract the highlights. I also made detailed notes after each topic in an attempt to summarize the most important points.Another suggestion is: do not overlay multiple screens in the lecture slides, it makes it hard to print them out and take notes without having to re-work them manually. Overall, small stuff but combined it makes a difference.

    Exams: largely ok, though there were times when the questions were tricky, with multiple possible answers and overpenalization if you don’t select the exact combination + added pressure from the time limit and Proctortrack. I prefer non-proctored exams, even if that means more analytical questions and more comprehensive coverage, given enough time to sit and think through them.

    The class is doable in the summer because the professor makes the material available ahead of time + you have 2 weeks for each HW, so even if you combine it with an easier 2nd class you can go through it. With a little more work in strengthening the lectures, this class could be a strong one since it covers great topics and the examples provided are really good. I don’t know how MGT 6203 will be reworked starting Fall 2019 but as of now I think if you take ISYE 6501 first and ISYE 6414 second, you’ll be good with basic stat concepts. MGT 6203 is not necessary.


    Semester:

    I was hesitant to take this course in the context of the reviews I’ve seen previously. That said, while I believe that much of the criticism is justified, it was still a worthwhile experience and not a particularly hard graduate course. Commentary about the errors in the lectures is quite true, and the method for assessing understanding (True/False questions that are sometimes worded problematically worded) is not a good choice. Some had issues with peer grading of the coding portions of the course (in terms of peer graders being inappropriately harsh or even incorrect). I think my biggest critique of the course is that the material is not taught as clearly as it really could be - I began rewriting information for my own understanding and I did not find it difficult to make clear. On the positive side, this has been an immediately valuable class, and I never would have sought out an understanding of the math and statistics around regression without it. Additionally, the TAs were extremely responsive and very strong on the topic - probably the best I’ve seen so far.


    Semester:

    I was really looking forward to this class as I feel regression analysis is a very important skill to have.

    I think the topics covered were relevant the the examples including the R code was very helpful. I do believe I learned in this class and it is really worth taking, but it had some challenges as discussed below.

    I was disappointed in the videos which has inaccuracies that were constantly being discussed on Piazza and corrected. If you only watched the videos you would be very misled. It was critical to keep up with the corrections from the TAs. In addition some of the ways the professor explained how to interpret results were very hard to follow. I could see by the Piazza posts and office hours discussion I was not the only one struggling with this. I supplemented this class with other classes I found online in order to get a good intuition on some of the topics. I used the videos to review for tests but this may have hurt me since they were not accurate. Thank God for the TAs in this class who gave us an accurate version of the transcript to use.

    Since I essentially had to take 2 classes this one and the one I found online, the amount of effort I put into this class was a lot more than expected.

    As for the tests I found the wording of the True/False questions to be an intention to trick instead of teach. It is difficult enough to have a time limit on the testing which really stresses me out anyway, but when the questions are worded in this way you need to take extra time dissecting each word it is even more frustrating. An example, you would have 2 sentences one which is accurate and one which is a judgement call. I decided in the second test if I would just default all of these to false. This strategy worked pretty well but how disappointing that my answers came down to this type of judgement.

    The office hours I attended some live in the beginning but decided they were not helpful. Later in the semester I started watching the recordings and realized the office hours did improve as the TAs attempted to supplement our learning by teaching topics along the way. These were very helpful in the second 1/2 of the semester along with the links in Piazza that told us which topics were covered in which office hours.

    My suggestion for future classes are to please update the videos so they can be depended on since this is the only interaction we ever had with the professor.


    Semester:

    I overall really liked this course. I enjoyed the schedule (4 sections, 1 homework each, ~3 weeks to watch lectures and complete homework in the summer term) much more than classes like the intros (ISYE 6501, CSE 6040) where there is homework due every single week. It lets you live your life a little instead of constantly churning out assignments.

    I thought Dr. Serban’s lectures were very helpful. They were clear, logical, and followed a common format that made it easy to understand each unit as the semester went on. The lecture transcripts were super helpful and are definitely something that I will hold onto as a tool for reference in the future. The TAs for this particular semester were really great too - super responsive on Piazza and really helpful explanations. I never attended office hours.

    The homeworks (T/F quiz and a peer reviewed assignment) aren’t too hard to do well on. I did good on the first test and bombed the final, but I also didn’t really put much time into either.

    Most importantly, I’ve used some of the stuff I learned in real value-added projects at work. That’s why I’m in the program.


    Semester:

    I am just going to echo my frustration wind this class another review which perfectly summarizes this class for me:

    This was the worst course I’ve ever taken in a collegiate setting.

    The worst thing about this class has been unclear instructions and expectations, combined with a lack of timely responses from the TAs and no responses (ever) from the instructor.

    Questions on homeworks / exams were frequently confusing / poorly-worded The last homework assignment was so poorly designed that it was impossible to know if your result would match the “solution” because the instructors didn’t understand what effect setting a seed has Major issues and questions raised on Piazza went unanswered for days or over a week in some cases Questions that were answered on Piazza were often answered incorrectly or incompletely by the TAs Rules and expectations for exams were changed mid-exam window The practice final solutions weren’t unlocked until a day or two into the final window The closed book section of the final wouldn’t open on day 1 of the finals window This class pretty much requires that you go learn regression on your own somewhere else I’m sure there is more I’m forgetting… Lecture Videos:

    Long, boring, and difficult to follow. There are frequent mistakes in both the words the professor is saying as well as the transcript. It’s so bad that the students have crowd-sourced a more-accurate transcript so people can have a chance at figuring out what’s happening. The professor would throw up a static slide of formulas (e.g. MLE or confidence interval derivations) and talk for several minutes with no real explanations to what was happening on the slide. It was just expected that you would look at it and understand it immediately. Homeworks are broken into two parts:

    T/F & Multiple Choice section - These usually aren’t terrible but the questions are often worded poorly. There were also issues here of how these were submitted in Canvas (in free-form) and then graded by a TA. Towards the end of the semester they realized they should just make all the questions T/F or Multiple Choice in Canvas. A coding section that is peer-reviewed - These have been pretty painful. They are often long and extremely tedious. (e.g. Do X, now do that 17 more times). There are 5 of these in total and at least 2 or 3 ended up being more than a 45 page pdf. How the question is phrased and how you are supposed to setup our problems are often ambiguous, which leads to a lot of problems with peer grading. Many peer graders are very strict when it comes to deviating from the “solutions” Midterm - wasn’t terrible. It’s in two parts like the homeworks:

    Closed book T/F & Multiple Choice questions. The biggest issue here was confusion with the rules. They had posted that people could use a singled-sided formula sheet. However, there was confusion as to what could be on that sheet, which they clarified (and made more strict) halfway through the exam window. So people who took the exam earlier benefited from a more ambiguous rule. Open book (no internet) coding section - This was mostly ok. Final - really bad

    Closed book T/F & Multiple Choice questions - very poorly worded questions and the style is very different than the Midterm. There are also a few topics not mentioned at all in the course up until now. Open book (no internet) coding section - this was basically impossible to finish in time. We’re still in the exam window and a lot of people are reporting scores < 70%


    Semester:

    I liked this class. The student, TA, and professor discussion was useful and much more pertinent than I ever got in college or medical school. I made a ton of progress in my analytical and coding toolboxes. But many of the criticisms were accurate about the course mechanics and especially the final exam. I will probably get B’s in both classes because I got nailed on the part II of the final exam for each class.

    How to beat the exams, this applies to 6402 and 6414!

    (1) get the course transcripts available the first week. Take notes and make corrections, add slides or equations to them, manipulate formulas to make sure you can apply them. 70% of the MC/TF questions come EXACT from the notes so the EXACT wording actually makes a difference. I used EXACT three times now…I think you get the point. Understanding a concept in general is not enough. The other 30% comes from applying the formulas, graphics, and concepts to solve multiple choice problems (usually worth multiple points and may have multiple correct answers). Many of those questions were “plug and chug” IF you have the correct formula.

    (2) The midterm was fine. Final was massive for the amount of time. People that use R in their jobs said they could not finish. The thing I noticed was that all the exam part II (Regression and TSA) questions were combinations of the homeworks or sample codes given in the classes. The questions were fair. The problem was adapting them fast enough to use on the test.

    Get together in groups to save time. Go through all the R code examples in the class and homeworks. Turn them into “GENERIC” functions….SHARE WITH YOUR CLASS AND CHECK THAT THEY WORK. If you can just transform your data and feed it back and forth into your canned functions (you can copy and paste them from R or Rstudio into your test document) it would allow you to get done with time to spare. you can even go so far as to put in some canned explanations with fill in the blanks (rememeber MADLIBS as a kid) accompanying your code and all you have to write is (IS/IS NOT CORRELATED) (IS/IS NOT MULTICOLLINEAR ….etc) (HYPOTHESIS TEST < > p.value indicates …)

    If you built the functions well you will have to alter less code to adapt it for the exam. you can lose 30-60 minutes if you mismatch your data types or need to debug subscripts to ensure alignment. If you have to do that AND adapt your code you will have a rough day.

    None of these are hard, you will learn to do them, but you need to move fast. This is what I remember off the top of my head. Disclaimer…I do not have perfect recall so forgive me if I make a mistake here.

    1 Splitting a dataset into training/test

    2 performing data analysis and graphics including boxplot, scatterplots, correlation, correlation matrices

    3 Models for linear regression, multiple linear regression with and without log transformed predictors and responses

    4 Models for logistic regression with repetitions for error analysis and without repetitions for classification errror.

    5 Poisson model

    6 Subset model (forward, backward, both)

    7 all subsets models(LEAPS library)

    8 hypothesis test (normality, correlation)

    9 residual plots and analysis

    10 plotting the data and a fitted model on the same plot

    11 using your models to generate predictions

    12 calculate your prediction errors (MAE,MPSE, MAPE…) and RMSE

    13 perform elastic net, ridgem and LASSO regression along with accompanying plots and prediction errors

    Time series

    1 Take a series of dates from factor or string to a DATE

    2a Fit a time series data set with a linear, polynomial, quadratic, harmonic, and splines regression, and be able to predict a few points ahead.

    2 plotting ACF, PACF, for original data, differenced data, residual data, and squared residual data

    3 arrange time series forward or backward in time

    3a plot multiple time series plots aligned on the same graphic

    4 be able to use forecast to predict values and plot them on top of a time series (using base package or ggplot2)

    5ARIMA - including submodels AR and MA and ARMA and EACF

    6GARCH

    7 ARMA-GARCH iteratively

    8 plotting squared residual plots to check for autocorrelation

    9 Calculating and interpreting hypothesis testing for significance, normality, correlation

    10 apply a vector autoregression model

    I recommend both classes


    Semester:

    Material is presented clearly and in a logical order. Sometimes the lectures can be a bit lacking in coherence for the highly theoretical material, but the data/coding examples are extremely well done and relevant to assignments and real world application. Peer grading is always a little bit of a crapshoot, but for the most part people seemed fair and even erring on the side of generosity. The final included a sizeable amount of material that was not at all covered in the material, though I suspect there must have been a generous curve or very lax grading because grades seemed reasonable. A large portion of the exam is coding. Material isn’t hard and it is open notes so you can copy from homework and example code pretty effectively, but it is proctored with that crummy ProctorTrack software that doesn’t allow for any breaks whatsoever, and the test itself is 3 hours long. I contacted the professor and TAs after the the exam because this requirement is unreasonable and hopefully it is somehow altered, but in the meantime you will definitely need to plan accordingly… I kept an empty gallon jug under my desk…


    Semester:

    I had taken ISYE6501 and MGT6203 before this class, both of which touched on regression. Also in both, classes, instructors and TA’s stressed the importance of regression analysis, even going as far as saying that regression is used in 90% of analytics work you will be doing in the real world. So I was very interested in taking this class.

    Content: This class goes deep into the Mathematical basis of regression. Not much new will be learned in terms of practical application and R that was not already taught in 6501 and 6203. In fact, even on the theory side, I found myself at times re-reading my notes from 6501/6203 to refresh my memory on concepts being taught in 6414. The way regression was taught in those 2 classes gave me a good intuition of what is happening in regression, something I would not have got from the way this class was taught. Unfortunately in my opinion, this is a case of “everything you really need to know, you learned in kindergarten (6501/6203)”.

    Instructor/TA’s: TA’s did a good job and were fairly responsive. Not much interaction with the instructor on Piazza, but I believe she did hold weekly live discussions on Bluejeans. Unfortunately due to time conflict, I could not attend those.

    Exams/Homework: All homeworks (4) and midterms were very manageable. Things changed drastically with the final exams, which caught many in the class off-guard. The final had 2 parts - 1st closed book test on theory; 2nd open book practical. The mean grade on part 1 was 62%. No grades yet on part 2, but judging from discussions on Piazza, grades will be worse than part 1. It was unreasonably long - the equivalent of 2 homeworks (I spent an average of 4 hours on each of previous homeworks), expected to be done in 1.5 hours.


    Semester:

    • Easy course to get an A in, although I would rate assignments to be overly repetitive. I did ISYE 6402 in this semester as well, and found both courses run by NS to have long assignments.
    • Had homework quizzes that we too easy.
    • Mid-semester exam was too easy.
    • To compensate for easy quizzes and mid-semester exam, they set a tough final to create cross-sectional spread in marks and differentiate students. Was good for me, because my daily work requires use of R and I’d consider myself proficient in this language. Bad for new users of the statistical language.
    • A key drawback was the lack of explanation in some concepts. To prepare for exams, I went through lecture videos/lecture notes/my notes 2-3 times. However, concepts were tested that were beyond the scope of the materials.
    • Easy course, but required hours on assignments.


    Semester:

    Overall, I had a decent experience in this course (except for the final exam, which I will discuss shortly). Dr. Serban’s lectures were fairly informative, if perhaps, a bit long-winded. The assignments were reasonably challenging from an intellectual standpoint (not overly so, however), but they did a solid job of mirroring/testing what we learned in the lectures and were graded via peer-review (not a good thing due to the subjectivity inherent in the assignment answers and the different attitudes/outlooks of the student graders). Some of the assignments were quite lengthy (one of our assignments had an average submission file length of 35 pages and the solution file was 51 pages!), but we usually had 2+ weeks to complete them. The midterm was reasonable and had both a multiple choice section and a moderate coding section (both were completed while being proctored under timed conditions).

    However, I do have one severe criticism pertaining to the final exam. Again, it was a 2-part exam with one part for multiple choice and one part for programming in R. I mentioned in the above paragraph that our assignments, while of reasonable difficulty, could be rather long. The coding portion of the final exam had 2 questions, each of which probably required at least 10 pages worth of R-code/plots/analysis exported to a pdf file in order to complete with a decent level of correctness/soundness. All of this was expected to be completed within 2.5 hours while being proctored. This generated a significant outcry from students on Piazza. To my knowledge, few (if any), actually finished the coding portion in its entirety; even the quick/talented/experienced individuals who use R regularly at their jobs were vocal about their displeasure with the amount of work required for that small time window. If we had an extra hour or so to do those questions, things would have been manageable, but as it stands, the instructors essentially tried to cram one of our 2 week 25 page homework projects into a 2.5 hour proctored time slot. That’s just not doable for probably 90+ percent of the class. It remains to be seen if there will be a curve for it, but there will hopefully be something to help the students out, since the majority of people probably only finished the first question with the requisite answer depth/quality (myself included).

    Overall, I would have been pleased with the course, but the highly unreasonable nature of the coding portion for the final exam really reduced my view of it. I hope that future classes either a) get more time to complete that much proctored coding/analysis or b) not have to try and do as much within a 2.5 hour window. If they can improve that, this course will be pretty solid in my view.


    Semester:

    This course needs a lot of work, and in it’s present incarnation, is a waste of your time. You will have to learn regression yourself, from other sources. In the meantime, you will be under pressure to do repetitive, mind numbing homework, take multiple choice exams that don’t test knowledge, but rather how to study the instructors badly worded asking style. I’m not sure i learned anything much in this class.


    Semester:

    This class was remarkably underwhelming. It’s too bad, because Advanced Regression could be one of the best courses in the program. The class isn’t really hard at all, except for the closed book final exam. My workload was around 6-7 hours a week max, and some weeks it was 2-3 hours.

    The lecture quality and poorly worded quiz questions (that often get regraded because the Professors/TAs realize they’re wrong) are the mot frustrating parts. There are multiple videos that the professor says “Wait I need to redo that, can we start over?”. I don’t really care that it’s in the video, but it just shows the lack of effort that is put into the videos and course structure. I took this class in Spring 2019 and former students said they remember the same thing from 2+ years ago.

    A couple of the homeworks were very long, but not challenging at all. You get some copy and paste practice, but not much else. The last homework assignment is probably the most beneficial, but again, there was a lack of guidance on some of the questions, and TA response time was very slow at times (might not be their fault).

    I think a restructuring in the class is needed. This was my third OMSA class, and the other two I thought were outstanding (6040 and 6501). I hope future students get a redesigned class, because Regression is something that can generally be applied right away, even if you’re in a lower level Data Analyst role.


    Semester:

    I will break by feedback into three parts: 1. the material , 2. the assessments, 3. the running of the class.

    1. The class generally covers some interesting and useful topics, however as we have to take ISYE6501 and MGT6203 we have already come across everything in this class at least once. What we get here is some additional statistical depth, and some extra practise with different datasets. I don’t think there is enough here to justify taking the class, even if it was taught well. As for the delivery, the lectures are extremely dry and often confusing. In addition there are a lot of mistakes in lectures and slides which have not all been corrected yet after multiple semesters.

    2. The HWs are split into TF/MC questions and submitted coding reports. The TF/MC questions generally feel like they are either ridiculously straight forward or like they are written to deliberately mislead. They are overall not too bad, but when you get a question wrong it’s usually because of the way the question should be interpreted rather than the difficulty of the question. The coding reports, which are peer reviewed, are usually excessively long and repetitive - they could assess the same material in a couple of questions but instead choose to repeat the same instructions sometimes 10+ times with small variations. The midterm was very fair, even perhaps too easy. The final however was awful - testing areas of the class which were never really covered, potentially incorrect auto-grading, badly worded questions, coding section was too long to be completed reasonably in the time limit…

    3. The class has been really badly run. Aside from the mistakes in the lectures and notes, and the frequent terribly worded questions in HWs and tests, TAs are MIA (or very selective in which questions they answer) for large portions of time, even within the final exam windows. There is often a lack of clarity around things which should be very simple, like test instructions. It just feels in general like minimal care is put into the running of the class, which is really frustrating for students.

    In summary, avoid this class unless you enjoy being frustrated and wasting your time. I really hope for the sake of the program they revisit this class and improve it, as it is quite a foundational topic and will be a first pick for many new students.


    Semester:

    This was the worst course I’ve ever taken in a collegiate setting.

    The worst thing about this class has been unclear instructions and expectations, combined with a lack of timely responses from the TAs and no responses (ever) from the instructor.

    • Questions on homeworks / exams were frequently confusing / poorly-worded
    • The last homework assignment was so poorly designed that it was impossible to know if your result would match the “solution” because the instructors didn’t understand what effect setting a seed has
    • Major issues and questions raised on Piazza went unanswered for days or over a week in some cases
    • Questions that were answered on Piazza were often answered incorrectly or incompletely by the TAs
    • Rules and expectations for exams were changed mid-exam window
    • The practice final solutions weren’t unlocked until a day or two into the final window
    • The closed book section of the final wouldn’t open on day 1 of the finals window
    • This class pretty much requires that you go learn regression on your own somewhere else
    • I’m sure there is more I’m forgetting…

    Lecture Videos:

    • Long, boring, and difficult to follow. There are frequent mistakes in both the words the professor is saying as well as the transcript. It’s so bad that the students have crowd-sourced a more-accurate transcript so people can have a chance at figuring out what’s happening.
    • The professor would throw up a static slide of formulas (e.g. MLE or confidence interval derivations) and talk for several minutes with no real explanations to what was happening on the slide. It was just expected that you would look at it and understand it immediately.

    Homeworks are broken into two parts:

    • T/F & Multiple Choice section - These usually aren’t terrible but the questions are often worded poorly. There were also issues here of how these were submitted in Canvas (in free-form) and then graded by a TA. Towards the end of the semester they realized they should just make all the questions T/F or Multiple Choice in Canvas.
    • A coding section that is peer-reviewed - These have been pretty painful. They are often long and extremely tedious. (e.g. Do X, now do that 17 more times). There are 5 of these in total and at least 2 or 3 ended up being more than a 45 page pdf. How the question is phrased and how you are supposed to setup our problems are often ambiguous, which leads to a lot of problems with peer grading. Many peer graders are very strict when it comes to deviating from the “solutions”

    Midterm - wasn’t terrible. It’s in two parts like the homeworks:

    • Closed book T/F & Multiple Choice questions. The biggest issue here was confusion with the rules. They had posted that people could use a singled-sided formula sheet. However, there was confusion as to what could be on that sheet, which they clarified (and made more strict) halfway through the exam window. So people who took the exam earlier benefited from a more ambiguous rule.
    • Open book (no internet) coding section - This was mostly ok.

    Final - really bad

    • Closed book T/F & Multiple Choice questions - very poorly worded questions and the style is very different than the Midterm. There are also a few topics not mentioned at all in the course up until now.
    • Open book (no internet) coding section - this was basically impossible to finish in time.
    • We’re still in the exam window and a lot of people are reporting scores < 70%


    Semester:

    Spring of 2019 was the first time the whole course was on edX and then the exams through Canvas. It’s only half way through the course, but I think this is a good course. Yes, it is tough, especially if you’ve never really had a statistics course before. (My background is a math degree.) The class uses R and a lot of the code is directly given for both examples to follow along in lectures and in homeworks. Then you write the rest of the analysis. So if you don’t know R well, you’ll still be ok. The hardest part of the class are the T/F assessments that are part of the homework quizzes and the exams. There is some tricky wordings and double negatives, etc. The lectures are challenging to follow but full of information. Dr. Serban offers weekly office hours directly with her, and if you can manage to log in, that is great. The TAs were available via 4 different office hours also and pretty active on Piazza. I found the best resource for the material was the Penn State online notes for Regression (Google: Penn State 501 Regression Analysis). I learned a ton in this class, and I liked that the lectures included several different data sets and that you could follow along with R code and practice. It was a tough class but manageable, especially if you’re already familiar with statistics and regression at an undergrad level.


    Semester:

    Pros:

    1. Real-world, memorable data sets that I think are more interesting than data sets included in R, and will help us remember the material better/longer
    2. Dr. Serban gave weekly office hours - her help in these office hours was tremendous, but also made me feel like she really cared about the course and the students
    3. I appreciated Dr. Serban’s respectful and encouraging attitude towards students, even with some complaining on Piazza and Slack (if she ever checked it).
    4. I felt like the TA’s did a solid job managing with Dr. Serban.
    5. Lots of great examples of the material.


    Semester:

    I love Dr. Serban, she highlights important material, and draws students’ attention to subtleties in material, contrasts concepts that are easy to confuse. She exhibits concern for students, and seems to be a genuinely good person.

    On the flip side, the amount of typos in both lecture and R code, and occasionally office hours is upsetting: it takes a lot of time (sometimes 2-3 hours) to realize that it’s not a disconnect in understanding on the student’s side. Hopefully, these will all be corrected for future semesters. Bonus for those of us with jobs - the ability to study ahead, the last half of the units was released mid-semester.

    The midterm was reasonable (not all 4 hours were needed), the final was awfully long and difficult (to make up for the midterm I guess?), I’m talking 4 hours, all proctortracked, going as fast as humanly possible, and still not having enough time to finish everything. Fortunately, the weight was shifted to favor the midterm when computing the final grade.

    The TA’s were helpful and nice, but they infrequently provided incorrect answers to academic/coding questions, so pay attention.


    Semester:

    I have mixed feelings about this course. Overall, what good you got out of it was marred by the constant mistakes, errors, poor responses on Piazza, and general feeling of disorganization in this course. The videos were fine, kind of long, and really dwelt too heavily on theory and not enough on examples and interpretation. It covered a lot of stuff I knew already (and likely most everyone with an undergrad degree) in too much detail and not enough on the stuff that was net new (basically, anything beyond Multiple Regression). A ton of the interesting newer techniques were in an optional 6th module that we were never tested on or applied in any way.

    The prof was MIA for most of the course on piazza. And this will sound very non-PC, but the TAs also were very poor in their english skills. This made for frustrating exchanges, and VERY POOR QUESTION DESIGNS. This point cannot be said enough – with true/false or multiple choice, it felt like 30% of questions had some deficiency that came down to language interpretation. On essentially every HW or exam, they made changes to question wording because of feedback from students.

    This also speaks a bit to how arbitrary some of this course felt. Especially when it came to interpreting residual plots – it felt like the prof or TAs could essentially claim either way (“this assumption HOLDS” or “this assumption does NOT hold”) while I was scratching my head, thinking there has to be a more scientific way to assess the model assumptions. There probably is, but I won’t leave this course knowing them.

    The final exam was also a complete gong show. Restricted use of the internet was NOT stated in the exam guidelines, which themselves were put an unstickied post. Then halfway through the exam window, the prof posts that we couldn’t use the internet for research during the exam. I and many others were completely confused and her response boiled down to “well whatever, it’s fine either way”. The exam itself was INSANELY hard compared to what was seen during the course, and a 4 hour window was not sufficient to finish. I’ve never had an exam feel so disconnected from the course material. To me, it was a further reflection of the lack of care put into this class: they probably didn’t even stop to realize how hard that exam was going to be. They just threw some questions together and thought it was good enough.

    So, after this course, I feel like I could have received what I’ll leave this class with in about 2 condensed hours of instruction. A couple of interesting use cases, but really not much beyond what we already learned in 6501 or in previous undergrad. A bit of a disappointment – I was hoping to get so much better armed leaving this class with interesting new techniques and tools. At the very least, the lecture notes did include code snippets that I’ll be able to use in the future.


    Semester:

    Course was taken in Fall 2018.

    I know not everyone wants to work through derivations and proofs, but this is a course where a little more depth might actually make the material feel easier and feel a little less rote/ad hoc. Overall, I did learn useful things but still feel like I need to pick up a book that can complete my understanding of regression analysis. The professor is nice and very knowledgeable, so I wonder if this is just a course that hasn’t adapted as well to an online format.


    Semester:

    You will learn regression, but that is mostly as a result of confusing homework and exams that force you to find outside sources for the majority of the content covered in class. Although you will mostly be relying on outside material for the conceptual component, the professor does a good job walking through the regression using R. This class is not very time consuming, but it is very confusing. If this class was cleaned up it would be a really good course, but I can’t recommend it in its current state.


    Semester:

    Overall I enjoyed this course – it provided a lot of the depth I needed to take the Time Series Course, I just wish I had taken this one first (not that this is expressly a prereq for Time Series, but it’ll make your life easier).

    In true Prof. Serban form, there are a few questions that could be worded better (or…more accurately to what she’s asking?) and the logistics of the course could use some work. But I learned a lot and am applying it in my day-to-day now (looking at you, logistic and poisson regression models!)

    Also, not a bad class in terms of workload. If you take MGT 6203 first, the first module will be fairly easy. The homework’s aren’t terrible if you follow along with the data example lectures.


    Semester:

    If you already took ISYE-6501 with Prof. Sokol, you are ready for this course, as this course use a lot R programming. But in 6501, we only taught to use R commands without knowing in details the underlying theory, math and model assumptions. In this course, Prof. Serban teach us in details the theory, assumption and in practice in R how to tell if the the assumptions holds or not, interpreting the model, and also what things we can do to make the model fit the data better. And of course you will know a lot more than just linear regression, like logistic, poisson regression etc.

    In short, this course will teach you a lot of useful stuff. Be brave about the math details, its there for a good reason. Highly recommended course for anyone poised to be data scientist.

    One thing that needs improvement is the homework, some of them has questions/problems thats confusing, misleading and unclear.


    Semester:

    Overall good course. Pace was sufficient especially for summer. A decent course to balance with a harder one.


    Semester:

    This is a really good course to take over the summer when you have to plan around vacations, moving, and other summer fun. Overall, I would say it is easy to get a B in this class, though you do have to work to get an A. The lecture material is not enough to do the HWs if you don’t already have a good base in probability and stats (which I didn’t). In summer, we had 4 HWs, 1 midterm, and 1 final.


    Semester:

    Overall course contents are good. Quality of course material needs attention in terms of errors correction before the course starts. This includes the Homework assignment questions correction as well. Because of summer (could be) there were only 2 TAs, Piazza questions resolutions were delayed most of the times. Many questions were unresolved even after the course ended. More frustrating was answers provided by one of TA used to be incorrect. Professor used to have a session per week but due to mid-morning timings, it was not possible for many students to attend that session. Personally, I could attend only 1 such session. To succeed in this course you need to spend upto 5 extra hrs on your research and study beyond the material provided.
    Case studies and programs were provided in the course material which helped a lot for homeworks but again due to material quality and incorrectness, you need to make sure interpretations for those are valid. ( Mainly logistic and Poisson regressions chapters.) Exams are okay and it is not very hard to get grade A in this course. It would have been great if course provided more opportunities to work on more real life use cases than homework with 100 variables which was of no use. Hopefully material quality gets attention before next semester.


    Semester:

    As others have mentioned, the videos and assignments need some editing. I personally did not find the videos very engaging compared to other courses. They are essentially Prof Serban standing in front of a camera reading slides for an entire semester. Many of the questions on the quizzes/midterms/etc seemed to be open for interpretation and were debated extensively on Piazza. Many of the multiple choice & T/F questions were also very focused on technical theory (i.e “what is the sampling distribution of the prediction of future responses under the Poisson regression model”) , which is fine for Stats majors, but for OMSA, I will not be retaining this level of detail. The examples were great and the R code provided with the examples gave a great starting point for HWs/Exams. For me, the workload was pretty light. It was essentially 1 week watching videos, 1 week HW, 1 week peer review… repeat 5x & mix in midterms/final. I averaged maybe 4 hours a week and finished with B+. Also- office hours w/ Prof Serban were @ 4 ET- they were recorded, but with only 1-2 students showing up, they were not very useful. I did learn a great deal about regression, but just did not feel engaged by this format.


    Semester:

    There are a couple of things going into this class that made it easy for myself personally, namely some stats background and some coding background. Overall, the course was very descriptive when it came to forcing the understanding of the few types of regression models discussed in the class. After taking this class, the takeaways should be the model code and the interpretation of the models both in a statistical way and in layman’s terms. The course material itself is quite straightforward, plug-and-play in just about every way and requires more regurgitation skill rather than critical thinking (making the exam open book made the course disgustingly easy).

    There are no surprises in this class - everything on the exam will have been discussed in enough detail in either the lecture videos or homework. If the question was ambiguous or flat-out wrong, the professor would personally review and clarify, even refunding lost points.

    5 homework assignments, each having a multiple choice section for a small portion of the grade and then a couple of self/peer reviewed sections having to do with the data analysis. Each one had a few weeks’ worth of time to complete, so the timing was quite reasonable unless you procrastinated. 3 exams with the same format as the homework, just spanning additional subjects and 2.5 hours to complete. All reasonable, unless you had a bug in your code.

    Only two drawbacks in this class - the material is incredibly dry and repetitive. There are 15 minute lectures for what could be explained at a high level in 3, but get drawn out and watched regardless (easily half of the week’s workload would be taking notes). The key takeaways will be identified in the lecture through boxes (hard copies of the powerpoints screw with physical note-takers like myself, though).

    The second drawback is the lack of statistical discipline by the professor at times. There has been some confusion when the professor uses terms that are not correct in a statistical sense (goodness-of-fit vs. predicting power) early in the course that screwed with lectures later in the course. During this semester, a couple of the videos could have been re-recorded due to some mistakes (I expect this to improve over the course of multiple iterations).

    Again, overall a good course to learn a few powerful, easy-to-understand models and a fairly reasonable class in terms of workload to pair with a perhaps more workload intensive class.


    Semester:

    The course was structured in a good way - videos on theory followed by a practical example. After taking the course, I’d feel comfortable building a production regression model.

    The course videos require serious editing/redoing. It’s obvious from some of them that they weren’t edited at all.


    Semester:

    I spent 120 hours on this course, or 8 hours on average over the 15 week period. Some weeks were heavier than others. I enjoyed the course! The course material is very interesting if you want a solid grounding in regression (linear, logistic, ANOVA, Poisson) and in Variable Selection; I enjoyed the part on Variable Selection most. The course assignments are in R, but it’s not too difficult because R code is always provided and well explained with the lectures; you then adapt this code for the 5 homework assignments, the 2 midterms and the final.

    The problem with this course is that sometimes communication is not clear, multiple choice questions are not well formulated, and there are errors in homework (exercises and solutions). Course staff made an effort to accommodate complaints. But I hope that the course material will be improved for future courses. I took this course in Spring 2018, but think it’s doable in the shorter Summer term if you count approximately 12 hours per week .


    Semester:

    The content of this course is good. If you come into this course as I did with very little previous education on linear regression, its a great class to start with. If you don’t understand regression and don’t know R, I would actually suggest this course as your first course in the program instead of ISYE 6501. Prof Serban does a nice job taking you through examples in R line by line to help you understand what you are doing. The biggest weakness of this course is the general organizational stuff. There were tons of typos in the homeworks and due dates seemed to shift. The course could also do with a pass through the video transcription to better match the video. Prof Serban, in the couple of instances I interacted with her, was generally flexible and accommodating of issues and feedback from students. My only other complaint is that I would have liked to spend more time working with the models talked about in unit six (non linear regression, non parametric regression, etc) The class focuses on Linear Regresssion, ANOVA, Logistic Regression, Poisson Regression and Variable Selection.


    Semester:

    There are 5 assignments, two midterms (2.5 hours), and a final (4 hours). The second midterm and the final were open notes. I would say workload averaged 7 hours a week, but generally I worked in chunks and would do one 15-20 hour week per homework assignment. The assignments were released early enough that we generally had 2-3 weeks to work on them. Lectures were longer than other OMSA classes, but the assignments weren’t extremely time intensive if you paid attention to the data examples in the lectures.

    By far the biggest challenge in this course was miscommunication and a lack of clarity around terms. The course videos frequently had mistakes, and the course staff did a bad job of communicating (both notifying about and correcting the errors and also about other course logistics). The true/false questions on the assignments and midterms were extremely frustrating because the language often was not clear.

    All that said, I really enjoyed the course material and felt the class was a very reasonable workload. The material can be a little repetitive, so some people may find the class a little boring. This is a course that could easily be taken in a summer semester, especially if you’ve taken a statistics course before.


    Semester:

    I have some experience in regression and found this class to be hard. Not because of the content but due to the style of the videos and logistical issues. This was the first semester the course was offered online and they were working out a ton of bugs. Deadlines and test styles changed throughout the semester. We were not told we could work together until it came out just a few days before the final. I am very glad I took the course, and the topics are really solid to know. The videos are very flat, with lots of room for multimedia improvement there. The concepts start high level, creating a learning cliff. All that aside, I would recommend this course to anyone in OMSA. I agree with the other reviews here: Solid topics, evolving materials.


    Semester: