29 Commits

Author SHA1 Message Date
yuxin-pc
df4c8cceac 新的proto定义 2026-02-12 08:57:47 +08:00
DELL
d023703622 [linkedin] 用户基本信息采集 2026-01-28 11:00:03 +08:00
DELL
959ffe6b2e [facebook]driver添加cookie,优化页面采集流程 2026-01-26 16:54:28 +08:00
DELL
93a8ff5ef4 [twitter]es_isrepost状态修改 2026-01-26 16:53:51 +08:00
DELL
4d2035b9fa [twitter]推文翻译:若无需翻译则返回空 2026-01-23 14:42:09 +08:00
DELL
d5be45ec95 [twitter]新增推特用户信息采集字段 2026-01-22 13:51:16 +08:00
DELL
b827e33dbd [twitter]新增推特推文翻译功能 2026-01-22 13:50:52 +08:00
DELL
7b3a83a1ab [twitter]用户名称对换 2026-01-21 17:53:48 +08:00
DELL
8631b0febf [twitter]新增用户信息采集功能 2026-01-21 17:52:17 +08:00
DELL
bf91c06801 翻译标题与内容字段替换-回滚 2026-01-21 15:39:07 +08:00
DELL
4d3cb2381a 翻译标题与内容字段替换 2026-01-21 11:01:27 +08:00
DELL
073f4325d0 es_isrepost 赋值修改为 1 2026-01-21 10:04:57 +08:00
DELL
8c84df0fdc [通用翻译] 翻译后标题修改 2026-01-20 17:23:31 +08:00
yuxin-pc
f7a210473a Merge branch 'main' of ssh://144.34.185.108:5282/osc-group/osc 2026-01-20 16:43:17 +08:00
yuxin-pc
ce478f495c 更新定义 2026-01-20 16:42:24 +08:00
DELL
92c8cdf9b2 [微博]redis 添加 cookie 成功请求获取返参 2026-01-20 16:36:44 +08:00
DELL
0008e619d1 [Twitter]删除多余注释 2026-01-20 16:35:35 +08:00
DELL
399165404e [通用翻译] 功能提交 2026-01-20 16:13:05 +08:00
DELL
9a36e9c5b5 [20260119]1、微信公众号扫码的脚本,改成调用Selenium Chrome,2、将TW、FB、微信公众号扫描调用Selenium的部分,抽象成一个方法;3、scrapy 框架 命令行启动注释 2026-01-19 17:18:53 +08:00
DELL
488bc2fdca Merge remote-tracking branch 'origin/main'
# Conflicts:
#	spiders/MediaSpiders/MediaSpiders/scrapy_selenium/middlewares.py
#	spiders/MediaSpiders/MediaSpiders/settings.py
#	spiders/MediaSpiders/MediaSpiders/spiders/TwitterUserSpider.py
#	spiders/MediaSpiders/run.py
2026-01-19 14:09:58 +08:00
yuxin-pc
89df3771e7 同步近期采集更改 2026-01-19 09:17:26 +08:00
DELL
a69ff25ce4 [twitterSpider]1、新增仿生物操作配置,2、修改业务逻辑,若自动化获取Cookie失败,则直接从redis中获取cookie;3、修改采集信息json层级 2026-01-16 16:30:41 +08:00
yuxin-pc
afe6c34db7 Update settings.py
调用服务地址改变
2025-12-23 13:46:47 +08:00
yuxin-pc
95ee8f5f59 Update WeiboUserSpider.py
批次大小和间隔时间修改
2025-07-23 15:33:43 +08:00
yuxin-pc
45110c22d3 Update WeiboUserSpider.py
适配新的ID
2025-07-22 15:11:06 +08:00
yuxin-pc
62fa085ec7 Update wechat_links_fetcher.py
更新UA
2025-06-24 09:47:08 +08:00
yuxin-pc
58bdf5cc0c Update middlewares.py
采集开始时显示并发线程
2025-06-13 09:41:04 +08:00
yuxin-pc
8f1999376f Update settings.py
配置项改回ZQ
2025-06-13 09:40:52 +08:00
yuxin-pc
cf4a6e2854 init 2025-05-28 19:16:17 +08:00