python learning route

reptile

  1. Basic knowledge: basic principles of website, html, python, multi-process/multi-thread/coroutine, etc. (must learn)

  2. HTML foundation, network request module: requests (must learn), urllib (you can understand)

  3. Need to understand some common anti-climbing strategies and corresponding solutions: common ones include IP frequency restrictions, User-Agent, Referer, Origen verification, cookie restrictions, dynamic loading and verification codes, etc.,

Corresponding processing methods include IP proxy pool, forged header, cookie storage and processing (basic advanced)

  1. Web page analysis extraction: Beautifulsoup&Xpath (choose one), regular expression (must learn)

  2. Dynamic execution of JS, js encryption and Selenium, OCR recognition or coding platform (optional)

  3. Data storage (file read and write, database, Excel/CSV module, etc.) (required)

  4. Network packet capture analysis (optional)

  5. Crawler framework: Scrapy (optional), pyspider (optional)

  6. Distributed crawler (optional)

Data analysis and processing

  1. Basic knowledge: python (functions, modules, object-oriented), regular expressions, JSON (must learn)

  2. Related to the above crawler:

·Basic knowledge: basic principles of website, html, python, multi-process/multi-thread/coroutine, etc. (required)

·HTML basis, network request module: requests (must learn), urllib (you can understand)

·Need to understand some common anti-climbing strategies and corresponding solutions: common IP frequency restrictions, User-Agent, Referer, Origen verification, Cookie restrictions, dynamic loading and verification codes, etc.,

Corresponding processing methods include IP proxy pool, forged header, cookie storage and processing (basic advanced)

·Web page analysis and extraction: Beautifulsoup&Xpath (choose one), regular expression (must learn)

·Dynamic execution of JS, js encryption and Selenium, OCR recognition or coding platform (optional)

·Data storage (file read and write, database, Excel/CSV module, etc.) (required)

  1. Data analysis related libraries: Pandas, Numpy, Scipy, stutter analysis, etc. (required)

  2. Chart drawing and visualization: Matplotlip, word cloud (must learn)

Big data (data mining, machine learning)

  1. Basic knowledge: python (basic + advanced) (must learn)

  2. Finance, statistics, econometrics, investment (required)

  3. Data storage (file read and write, database, Excel/CSV module, etc.) (required)

  4. Data analysis related libraries: Pandas, Numpy, Scipy, stammering word segmentation (must learn)

  5. Chart drawing and visualization: Matplotlip, etc. (required)

  6. Machine learning related model knowledge: Naive Bayes, decision tree, Logistic regression, linear regression, KNN algorithm, SVM,

Boosting, clustering, recommendation system, pLSA, LDA, GDBT, Regularization, anomaly detection, EM algorithm, Apriori,

FP Growth, etc. (required)

  1. Machine learning related libraries: sklearn (required), keras, statsmodels, tensorflow (optional)

Recommended Posts

python learning route
python list learning
Python entry learning materials
Python3 entry learning four.md
Python function basic learning
python_ crawler basic learning
python_ regular expression learning
Python3 entry learning three.md
Python3 entry learning one.md
Python3 entry learning two.md
Python regular expression quick learning
Python programming Pycharm fast learning
Getting started python learning steps
Python magic function eval () learning
Python regular expression learning small example
Learning Python third day one-line function
Learning path of python crawler development
Python learning os module and usage
Python multithreading
Python CookBook
Python FAQ
Python3 dictionary
python (you-get)
Python string
Python basics
Python descriptor
Python basics 2
Python exec
Python notes
Python3 tuple
CentOS + Python3.6+
Two days of learning the basics of Python
Python advanced (1)
Python decorator
Python IO
Python multithreading
Python toolchain
Python3 list
Python multitasking-coroutine
Python overview
python introduction
Python analytic
Python basics
07. Python3 functions
Python basics 3
Python multitasking-threads
Python functions
python sys.stdout
python operator
Python entry-3
Centos 7.5 python3.6
Python string
python queue Queue
Python basics 4
Python basics 5
Recommendations of a few websites for learning Python
How about learning python at the age of 27?