Science Services

Python is a high-level and general-purpose programming language. As this definition implies, you can use Python for several purposes, from web development to data science, machine learning, and robotic. Real-world use cases are expansive. At Brandeis Data Services, we focus on the use of python programming for data analytics to support research, but the Brandeis Library has resources on many more applications that are available to you!

Installing Python

Python 3 has many popular scientific libraries. There are many ways to install Python, but installing Anaconda ensures the installation of key libraries and environments. Detailed installation instructions for various operating systems can be found on The Carpentries template website for workshops and in Anaconda documentation.

Data Services Workshops

This asynchronous resource starts at Python fundamentals for beginners and covers lists and dictionaries, NumPy, Pandas, and creating figures.

Use statistical models to estimate a simple relationship between two noisy sets of observations through the common framework of linear regression. We will use the statistical graphics library, seaborn, for this integral exercise in statistical analysis.

Practice visualizing how variables in your dataset relate to each other and depend on other variables. We will use the statistical graphics library, seaborn, for this integral exercise in statistical analysis.

Python Resources Online

Python Resources at Brandeis Library

There are many sources of information for getting started in Data Science. If you are interested in particular techniques or materials, see the Brandeis Library resources below and consult with Data Services for guidance in pursuing your interests and setting up your programming environment.

Brandeis Library patrons can access many of the latest texts on Data Science methods through OneSearch. Just a few examples are:

Python for the Life Sciences by Alexander Lancaster; Gordon Webster Treat yourself to a lively, intuitive, and easy-to-follow introduction to computer programming in Python. The book was written specifically for biologists with little or no prior experience of writing code - with the goal of giving them not only a foundation in Python programming, but also the confidence and inspiration to start using Python in their own research. Virtually all of the examples in the book are drawn from across a wide spectrum of life science research, from simple biochemical calculations and sequence analysis, to modeling the dynamic interactions of genes and proteins in cells, or the drift of genes in an evolving population. Best of all, Python for the Life Sciences shows you how to implement all of these projects in Python, one of the most popular programming languages for scientific computing. If you are a life scientist interested in learning Python to jump-start your research, this is the book for you. What You'll Learn  Write Python scripts to automate your lab calculations Search for important motifs in genome sequences Use object-oriented programming with Python Study mining interaction network data for patterns Review dynamic modeling of biochemical switches Who This Book Is For Life scientists with little or no programming experience, including undergraduate and graduate students, postdoctoral researchers in academia and industry, medical professionals, and teachers/lecturers. "A comprehensive introduction to using Python for computational biology. A lovely book with humor and perspective"  -- John Novembre, Associate Professor of Human Genetics, University of Chicago and MacArthur Fellow "Fun, entertaining, witty and darn useful. A magical portal to the big data revolution" -- Sandro Santagata, Assistant Professor in Pathology, Harvard Medical School"Alex and Gordon's enthusiasm for Python is contagious" -- Glenys Thomson Professor of Integrative Biology, University of California, Berkeley

ISBN: 1484245229 Publication Date: 2019-09-28

A Python Data Analyst's Toolkit by Gayathri Rajagopalan Explore the fundamentals of data analysis, and statistics with case studies using Python. This book will show you how to confidently write code in Python, and use various Python libraries and functions for analyzing any dataset. The code is presented in Jupyter notebooks that can further be adapted and extended. This book is divided into three parts - programming with Python, data analysis and visualization, and statistics. You'll start with an introduction to Python - the syntax, functions, conditional statements, data types, and different types of containers.  You'll then review more advanced concepts like regular expressions, handling of files, and solving mathematical problems with Python.  The second part of the book, will cover Python libraries used for data analysis. There will be an introductory chapter covering basic concepts and terminology, and one chapter each on NumPy(the scientific computation library), Pandas (the data wrangling library) and visualization libraries like Matplotlib and Seaborn. Case studies will be included as examples to help readers understand some real-world applications of data analysis.  The final chapters of book focus on statistics, elucidating important principles in statistics that are relevant to data science. These topics include probability, Bayes theorem, permutations and combinations, and hypothesis testing (ANOVA, Chi-squared test, z-test, and t-test), and how the Scipy library enables simplification of tedious calculations involved in statistics. What You'll Learn Further your programming and analytical skills with Python Solve mathematical problems in calculus, and set theory and algebra with Python Work with various libraries in Python to structure, analyze, and visualize data Tackle real-life case studies using Python Review essential statistical concepts and use the Scipy library to solve problems in statistics  Who This Book Is For Professionals working in the field of data science interested in enhancing skills in Python, data analysis and statistics.

ISBN: 9781484263983 Publication Date: 2020-12-23

Numerical Python by Robert Johansson Leverage the numerical and mathematical modules in Python and its standard library as well as popular open source numerical Python packages like NumPy, SciPy, FiPy, matplotlib and more. This fully revised edition, updated with the latest details of each package and changes to Jupyter projects, demonstrates how to numerically compute solutions and mathematically model applications in big data, cloud computing, financial engineering, business management and more.  Numerical Python, Second Edition, presents many brand-new case study examples of applications in data science and statistics using Python, along with extensions to many previous examples. Each of these demonstrates the power of Python for rapid development and exploratory computing due to its simple and high-level syntax and multiple options for data analysis.  After reading this book, readers will be familiar with many computing techniques including array-based and symbolic computing, visualization and numerical file I/O, equation solving, optimization, interpolation and integration, and domain-specific computational problems, such as differential equation solving, data analysis, statistical modeling and machine learning. What You'll Learn Work with vectors and matrices using NumPy Plot and visualize data with Matplotlib Perform data analysis tasks with Pandas and SciPy Review statistical modeling and machine learning with statsmodels and scikit-learn Optimize Python code using Numba and Cython Who This Book Is For Developers who want to understand how to use Python and its related ecosystem for numerical computing.