Pycharm安装en_core_web_sm失败出现Could not find a version that satisfies the requirement spacy报错

使用Pycharm在复现Understanding Dataset Difficulty with V-Usable Information (ICML 2022, outstanding paper)在GitHub的代码中存在关于en_core_web_sm没有模块无法安装的问题

我在官方网站下载了en_core_web_sm的3.0.0版本tag包,但是使用pip install en_core_web_sm_的地址 出现了无法安装的问题

PS C:\Users\000\Downloads> pip install "C:\Users\000\Downloads\en_core_web_sm-3.0.0.tar.gz"

结果及报错

Processing c:\users\000\downloads\en_core_web_sm-3.0.0.tar.gz
  Preparing metadata (setup.py) ... done
WARNING: Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProxyError('Your proxy appears to only use HTTP and not HTTPS, try changing your proxy URL to be HTTP. See: https://urllib3.readthedocs.io/en/1.26.x/advanced-usage.html#https-proxy-error-http-proxy', SSLError(SSLError(1, '[SSL: WRONG_VERSION_NUMBER] wrong version number (_ssl.c:1123)')))': /simple/spacy/
WARNING: Retrying (Retry(total=3, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProxyError('Your proxy appears to only use HTTP and not HTTPS, try changing your proxy URL to be HTTP. See: https://urllib3.readthedocs.io/en/1.26.x/advanced-usage.html#https-proxy-error-http-proxy', SSLError(SSLError(1, '[SSL: WRONG_VERSION_NUMBER] wrong version number (_ssl.c:1123)')))': /simple/spacy/
WARNING: Retrying (Retry(total=2, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProxyError('Your proxy appears to only use HTTP and not HTTPS, try changing your proxy URL to be HTTP. See: https://urllib3.readthedocs.io/en/1.26.x/advanced-usage.html#https-proxy-error-http-proxy', SSLError(SSLError(1, '[SSL: WRONG_VERSION_NUMBER] wrong version number (_ssl.c:1123)')))': /simple/spacy/
WARNING: Retrying (Retry(total=1, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProxyError('Your proxy appears to only use HTTP and not HTTPS, try changing your proxy URL to be HTTP. See: https://urllib3.readthedocs.io/en/1.26.x/advanced-usage.html#https-proxy-error-http-proxy', SSLError(SSLError(1, '[SSL: WRONG_VERSION_NUMBER] wrong version number (_ssl.c:1123)')))': /simple/spacy/
WARNING: Retrying (Retry(total=0, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProxyError('Your proxy appears to only use HTTP and not HTTPS, try changing your proxy URL to be HTTP. See: https://urllib3.readthedocs.io/en/1.26.x/advanced-usage.html#https-proxy-error-http-proxy', SSLError(SSLError(1, '[SSL: WRONG_VERSION_NUMBER] wrong version number (_ssl.c:1123)')))': /simple/spacy/
ERROR: Could not find a version that satisfies the requirement spacy<=3.0.0, >3.0.0 (from en-core-web-sm) (from versions: none)
ERROR: No matching distribution found for spacy<=3.0.0, >3.0.0
WARNING: There was an error checking the latest version of pip.

我的版本

Flask    2.2.2    
GitPython    3.1.30    
HeapDict    1.0.1    
Jinja2    3.1.2    
Lasagne    0.1    
Mako    1.2.4    
Markdown    3.4.1    
MarkupSafe    2.1.1    
Pillow    9.3.0    
PyJWT    2.6.0    
PyMySQL    1.0.2    
PyWavelets    1.4.1    
PyYAML    6.0    
SQLAlchemy    1.4.46    
Theano    1.0.5    
Tree    0.2.4    
Werkzeug    2.2.2    
XlsxWriter    3.0.3    
absl-py    1.3.0    
accelerate    0.16.0    
aiohttp    3.8.3    
aiosignal    1.3.1    
alembic    1.9.2    
astunparse    1.6.3    
async-timeout    4.0.2    
attrs    22.2.0    
blis    0.7.9    
cachetools    5.2.0    
catalogue    2.0.8    
certifi    2022.9.24    
charset-normalizer    2.1.1    
click    8.1.3    
cloudpickle    2.2.1    
colorama    0.4.6    
confection    0.0.4    
contourpy    1.0.6    
cycler    0.11.0    
cymem    2.0.7    
dask    2023.1.1    
databricks-cli    0.17.4    
datasets    2.9.0    
dill    0.3.6    
distributed    2023.1.1    
docker    6.0.1    
entrypoints    0.4    
filelock    3.9.0    
flatbuffers    22.12.6    
fonttools    4.38.0    
frozenlist    1.3.3    
fsspec    2023.1.0    
gast    0.4.0    
gitdb    4.0.10    
google-auth    2.15.0    
google-auth-oauthlib    0.4.6    
google-pasta    0.2.0    
greenlet    2.0.1    
grpcio    1.51.1    
h5py    3.7.0    
huggingface-hub    0.11.1    
idna    3.4    
imageio    2.22.4    
importlib-metadata    5.2.0    
itsdangerous    2.1.2    
jax    0.4.1    
joblib    1.2.0    
keras    2.11.0    
kiwisolver    1.4.4    
langcodes    3.3.0    
libclang    14.0.6    
lightgbm    3.3.5    
llvmlite    0.39.1    
locket    1.0.0    
matplotlib    3.6.1    
mlflow    2.1.1    
msgpack    1.0.4    
multidict    6.0.4    
multiprocess    0.70.14    
murmurhash    1.0.9    
mysql    0.0.3    
mysql-connector-python    8.0.31    
mysqlclient    2.1.1    
natsort    8.2.0    
necessary    0.3.1    
networkx    2.8.8    
nltk    3.8.1    
numba    0.56.4    
numpy    1.23.4    
oauthlib    3.2.2    
opencv-python    4.6.0.66    
opt-einsum    3.3.0    
packaging    21.3    
pandas    1.5.1    
partd    1.3.0    
pathy    0.10.1    
patsy    0.5.3    
pip    22.3.1    
preshed    3.0.8    
protobuf    3.19.6    
psutil    5.9.4    
py3Dmol    1.8.1    
pyarrow    10.0.1    
pyasn1    0.4.8    
pyasn1-modules    0.2.8    
pydantic    1.10.2    
pyparsing    3.0.9    
pytesseract    0.3.10    
python-dateutil    2.8.2    
pytz    2022.6    
pywin32    305    
pywttr-models    1.0.2    
querystring-parser    1.2.4    
regex    2022.10.31    
requests    2.28.1    
requests-oauthlib    1.3.1    
responses    0.18.0    
rsa    4.9    
scikit-image    0.19.3    
scikit-learn    1.1.3    
scipy    1.9.3    
seaborn    0.12.1    
setuptools    60.2.0    
shap    0.41.0    
six    1.16.0    
slicer    0.0.7    
smart-open    6.3.0    
smmap    5.0.0    
sns    0.1    
sortedcontainers    2.4.0    
spacy    3.5.0    
spacy-legacy    3.0.12    
spacy-loggers    1.0.4    
spacytextblob    4.0.0    
sqlparse    0.4.3    
srsly    2.4.5    
statsmodels    0.13.5    
stumpy    1.11.1    
svgwrite    1.4.3    
svm    0.1.0    
tabulate    0.9.0    
tblib    1.7.0    
tensorboard    2.11.0    
tensorboard-data-server    0.6.1    
tensorboard-plugin-wit    1.8.1    
tensorflow    2.11.0    
tensorflow-estimator    2.11.0    
tensorflow-intel    2.11.0    
tensorflow-io-gcs-filesystem    0.29.0    
termcolor    2.1.1    
textblob    0.15.3    
the    0.1.5    
thinc    8.1.7    
threadpoolctl    3.1.0    
tifffile    2022.10.10    
timm    0.6.12    
tokenizers    0.13.2    
toolz    0.12.0    
torch    1.13.1    
torchvision    0.14.1    
tornado    6.2    
tqdm    4.64.1    
transformers    4.26.0    
tsfresh    0.20.0    
typer    0.7.0    
typing-extensions    4.4.0    
urllib3    1.26.12    
utils    1.0.1    
waitress    2.1.2    
wasabi    1.1.1    
websocket-client    1.4.2    
wheel    0.37.1    
wrapt    1.14.1    
xgboost    1.7.0    
xmltodict    0.13.0    
xxhash    3.2.0    
yarl    1.8.2    
zict    2.2.0    
zipp    3.11.0    

首先,我并没有在pycharm中找到该模块,因此我从网上下载了en_core_web_sm-3.0.0.tar.gz,并且尝试使用pip进行安装,但是并没有成功,这个报错是因为版本吗,应该如何适配?

该回答引用ChatGPT
请参考下面的方法,如果 可行还请 点击 采纳 感谢!

这可能是由于您的代理服务器使用的是HTTP协议,而不是HTTPS协议导致的。 您可以尝试更改代理URL为HTTP。您也可以考虑使用管理员身份运行命令行,或者在命令行中使用以下命令:pip install --trusted-host pypi.org en_core_web_sm。

https://blog.csdn.net/weishuai90/article/details/128750678 可以参考一下。whl文件直接安装

提供参考实例【NLP工具最新版Spacy及语言包en_core_web_sm下载安装指南】,链接:https://blog.csdn.net/henanlion/article/details/117446125