使用Pycharm在复现Understanding Dataset Difficulty with V-Usable Information (ICML 2022, outstanding paper)在GitHub的代码中存在关于en_core_web_sm没有模块无法安装的问题
我在官方网站下载了en_core_web_sm的3.0.0版本tag包,但是使用pip install en_core_web_sm_的地址 出现了无法安装的问题
PS C:\Users\000\Downloads> pip install "C:\Users\000\Downloads\en_core_web_sm-3.0.0.tar.gz"
结果及报错
Processing c:\users\000\downloads\en_core_web_sm-3.0.0.tar.gz
Preparing metadata (setup.py) ... done
WARNING: Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProxyError('Your proxy appears to only use HTTP and not HTTPS, try changing your proxy URL to be HTTP. See: https://urllib3.readthedocs.io/en/1.26.x/advanced-usage.html#https-proxy-error-http-proxy', SSLError(SSLError(1, '[SSL: WRONG_VERSION_NUMBER] wrong version number (_ssl.c:1123)')))': /simple/spacy/
WARNING: Retrying (Retry(total=3, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProxyError('Your proxy appears to only use HTTP and not HTTPS, try changing your proxy URL to be HTTP. See: https://urllib3.readthedocs.io/en/1.26.x/advanced-usage.html#https-proxy-error-http-proxy', SSLError(SSLError(1, '[SSL: WRONG_VERSION_NUMBER] wrong version number (_ssl.c:1123)')))': /simple/spacy/
WARNING: Retrying (Retry(total=2, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProxyError('Your proxy appears to only use HTTP and not HTTPS, try changing your proxy URL to be HTTP. See: https://urllib3.readthedocs.io/en/1.26.x/advanced-usage.html#https-proxy-error-http-proxy', SSLError(SSLError(1, '[SSL: WRONG_VERSION_NUMBER] wrong version number (_ssl.c:1123)')))': /simple/spacy/
WARNING: Retrying (Retry(total=1, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProxyError('Your proxy appears to only use HTTP and not HTTPS, try changing your proxy URL to be HTTP. See: https://urllib3.readthedocs.io/en/1.26.x/advanced-usage.html#https-proxy-error-http-proxy', SSLError(SSLError(1, '[SSL: WRONG_VERSION_NUMBER] wrong version number (_ssl.c:1123)')))': /simple/spacy/
WARNING: Retrying (Retry(total=0, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProxyError('Your proxy appears to only use HTTP and not HTTPS, try changing your proxy URL to be HTTP. See: https://urllib3.readthedocs.io/en/1.26.x/advanced-usage.html#https-proxy-error-http-proxy', SSLError(SSLError(1, '[SSL: WRONG_VERSION_NUMBER] wrong version number (_ssl.c:1123)')))': /simple/spacy/
ERROR: Could not find a version that satisfies the requirement spacy<=3.0.0, >3.0.0 (from en-core-web-sm) (from versions: none)
ERROR: No matching distribution found for spacy<=3.0.0, >3.0.0
WARNING: There was an error checking the latest version of pip.
我的版本
Flask 2.2.2
GitPython 3.1.30
HeapDict 1.0.1
Jinja2 3.1.2
Lasagne 0.1
Mako 1.2.4
Markdown 3.4.1
MarkupSafe 2.1.1
Pillow 9.3.0
PyJWT 2.6.0
PyMySQL 1.0.2
PyWavelets 1.4.1
PyYAML 6.0
SQLAlchemy 1.4.46
Theano 1.0.5
Tree 0.2.4
Werkzeug 2.2.2
XlsxWriter 3.0.3
absl-py 1.3.0
accelerate 0.16.0
aiohttp 3.8.3
aiosignal 1.3.1
alembic 1.9.2
astunparse 1.6.3
async-timeout 4.0.2
attrs 22.2.0
blis 0.7.9
cachetools 5.2.0
catalogue 2.0.8
certifi 2022.9.24
charset-normalizer 2.1.1
click 8.1.3
cloudpickle 2.2.1
colorama 0.4.6
confection 0.0.4
contourpy 1.0.6
cycler 0.11.0
cymem 2.0.7
dask 2023.1.1
databricks-cli 0.17.4
datasets 2.9.0
dill 0.3.6
distributed 2023.1.1
docker 6.0.1
entrypoints 0.4
filelock 3.9.0
flatbuffers 22.12.6
fonttools 4.38.0
frozenlist 1.3.3
fsspec 2023.1.0
gast 0.4.0
gitdb 4.0.10
google-auth 2.15.0
google-auth-oauthlib 0.4.6
google-pasta 0.2.0
greenlet 2.0.1
grpcio 1.51.1
h5py 3.7.0
huggingface-hub 0.11.1
idna 3.4
imageio 2.22.4
importlib-metadata 5.2.0
itsdangerous 2.1.2
jax 0.4.1
joblib 1.2.0
keras 2.11.0
kiwisolver 1.4.4
langcodes 3.3.0
libclang 14.0.6
lightgbm 3.3.5
llvmlite 0.39.1
locket 1.0.0
matplotlib 3.6.1
mlflow 2.1.1
msgpack 1.0.4
multidict 6.0.4
multiprocess 0.70.14
murmurhash 1.0.9
mysql 0.0.3
mysql-connector-python 8.0.31
mysqlclient 2.1.1
natsort 8.2.0
necessary 0.3.1
networkx 2.8.8
nltk 3.8.1
numba 0.56.4
numpy 1.23.4
oauthlib 3.2.2
opencv-python 4.6.0.66
opt-einsum 3.3.0
packaging 21.3
pandas 1.5.1
partd 1.3.0
pathy 0.10.1
patsy 0.5.3
pip 22.3.1
preshed 3.0.8
protobuf 3.19.6
psutil 5.9.4
py3Dmol 1.8.1
pyarrow 10.0.1
pyasn1 0.4.8
pyasn1-modules 0.2.8
pydantic 1.10.2
pyparsing 3.0.9
pytesseract 0.3.10
python-dateutil 2.8.2
pytz 2022.6
pywin32 305
pywttr-models 1.0.2
querystring-parser 1.2.4
regex 2022.10.31
requests 2.28.1
requests-oauthlib 1.3.1
responses 0.18.0
rsa 4.9
scikit-image 0.19.3
scikit-learn 1.1.3
scipy 1.9.3
seaborn 0.12.1
setuptools 60.2.0
shap 0.41.0
six 1.16.0
slicer 0.0.7
smart-open 6.3.0
smmap 5.0.0
sns 0.1
sortedcontainers 2.4.0
spacy 3.5.0
spacy-legacy 3.0.12
spacy-loggers 1.0.4
spacytextblob 4.0.0
sqlparse 0.4.3
srsly 2.4.5
statsmodels 0.13.5
stumpy 1.11.1
svgwrite 1.4.3
svm 0.1.0
tabulate 0.9.0
tblib 1.7.0
tensorboard 2.11.0
tensorboard-data-server 0.6.1
tensorboard-plugin-wit 1.8.1
tensorflow 2.11.0
tensorflow-estimator 2.11.0
tensorflow-intel 2.11.0
tensorflow-io-gcs-filesystem 0.29.0
termcolor 2.1.1
textblob 0.15.3
the 0.1.5
thinc 8.1.7
threadpoolctl 3.1.0
tifffile 2022.10.10
timm 0.6.12
tokenizers 0.13.2
toolz 0.12.0
torch 1.13.1
torchvision 0.14.1
tornado 6.2
tqdm 4.64.1
transformers 4.26.0
tsfresh 0.20.0
typer 0.7.0
typing-extensions 4.4.0
urllib3 1.26.12
utils 1.0.1
waitress 2.1.2
wasabi 1.1.1
websocket-client 1.4.2
wheel 0.37.1
wrapt 1.14.1
xgboost 1.7.0
xmltodict 0.13.0
xxhash 3.2.0
yarl 1.8.2
zict 2.2.0
zipp 3.11.0
首先,我并没有在pycharm中找到该模块,因此我从网上下载了en_core_web_sm-3.0.0.tar.gz,并且尝试使用pip进行安装,但是并没有成功,这个报错是因为版本吗,应该如何适配?
该回答引用ChatGPT
请参考下面的方法,如果 可行还请 点击 采纳 感谢!
这可能是由于您的代理服务器使用的是HTTP协议,而不是HTTPS协议导致的。 您可以尝试更改代理URL为HTTP。您也可以考虑使用管理员身份运行命令行,或者在命令行中使用以下命令:pip install --trusted-host pypi.org en_core_web_sm。
https://blog.csdn.net/weishuai90/article/details/128750678 可以参考一下。whl文件直接安装
提供参考实例【NLP工具最新版Spacy及语言包en_core_web_sm下载安装指南】,链接:https://blog.csdn.net/henanlion/article/details/117446125