GPT爬虫:一键采集网站数据、无缝构建GPTs知识库,免编程 | GPT-Crawler,网站内容转GPTs知识库的神器!

  Рет қаралды 45,785

AI学长小林

AI学长小林

Күн бұрын

【GPTs秘密武器】一键抓取网页内容,自动转为GPTs的知识库。无需编程、轻松构建强大的GPTs!
GPT-Crawler:github.com/BuilderIO/gpt-crawler
时间轴:
00:00 GPT爬虫工具介绍
00:31 GPT爬虫效果演示
03:55 如何安装和使用GPT爬虫
15:59 如何用GPT Actions获取实时数据
我的AI课程《ChatGPT实战指南》已开放,报名地址 🔽
【国际版】ailinplus-s-school.teachable....
【国内版】m.qlchat.com/wechat/page/chan...
【承接GPTs定制】linbintalk.com/GPTs-4ab54f0ac...
【ChatGPT成品号】nf.video/8PO57 (折扣码:xiaolin 享受9.3折)
【我的博客】linbintalk.com(每天更新ChatGPT&AI最新消息)
【订阅频道链接】 / @linbintalk
【问题咨询&免费领取AI资料】ailinplus
AI资料含:200+AI行业文档(每月更新)、吴恩达全套AI课程中文翻译版、100+AI视频剪辑素材
视频创作不易,如果你喜欢我的视频,请点赞+订阅;你的支持,是我前进的不竭动力!
这是一个人工智能、前沿科技的学习频道,分享AI、ChatGPT、元宇宙,以及前沿科技的干货内容。
频道精品视频:
1、最新数字人克隆教程,内含独家省钱技巧: • 最新数字人克隆教程:全流程手把手,轻松创建你...
2、一站式解锁:ChatGPT注册、ChatGPT Plus、OpenAI API充值: • 【实用教程】一站式解锁:ChatGPT注册、...
3、破解ChatGPT Token限制,最大化利用代码解释器,从此无惧长对话: • 破解ChatGPT Token限制,最大化利...
4、用ChatGPT打造自动化工作流程:写日报、资讯筛选、评论回复等: • 解放双手,让ChatGPT为你工作,实现自动...
5、用ChatGPT打造专属AI助理,实现智能问答、AI客服、内容创作: • 【解放自己】用ChatGPT打造专属AI助理...
6、一招让ChatGPT秒变AutoGPT,自动拆解、完成复杂任务,完美释放ChatGPT潜力: • 一招让ChatGPT秒变AutoGPT,自动...
7、将ChatGPT变成AI导师,只需要一个Prompt: • 将ChatGPT变成AI导师,只需要一个Pr...
8、ChatGPT批量创作小红书文案、爆款文章,打造全自动文案工厂: • ChatGPT批量创作小红书文案、爆款文章,...
9、ChatGPT多模态玩法汇总,这些应用场景你绝对想不到,快速精通GPT-4V: • ChatGPT多模态玩法汇总,这些应用场景你...
10、【AI革命】视频翻译隆重登场,音色、嘴型自动同步,让你的视频全球通用: • 【AI革命】视频翻译隆重登场,音色、嘴型自动...
-----------------------------
👉 微信:ailinplus
👉 Telegram粉丝群:t.me/linbintalk01
👉 我的推特: / linbintalk
#GPTs技巧 #AI自动化工具 #GPT爬虫 #GPTCrawler #GPTs知识库 #网站数据采集 #GPTs工具应用

Пікірлер: 112
@user-nkbzabh
@user-nkbzabh 6 ай бұрын
非常感谢博主,成功了
@linbintalk
@linbintalk 6 ай бұрын
🤝多多支持
@user-kt2yb3ux5u
@user-kt2yb3ux5u 7 ай бұрын
很赞,迫不及待实操
@linbintalk
@linbintalk 7 ай бұрын
这个真的好用
@ambitionaura_lucky
@ambitionaura_lucky 5 ай бұрын
看到一半实在忍不住了,不行,一定要点个赞!
@linbintalk
@linbintalk 5 ай бұрын
哈哈,感谢感谢
@user-uf8qe3ib1r
@user-uf8qe3ib1r 4 ай бұрын
讲得真好啊,林兄真的是想把我们教会啊🤣
@linbintalk
@linbintalk 4 ай бұрын
那是必须的、主打一个真教
@SJT-jb9gz
@SJT-jb9gz 5 ай бұрын
Great video. Wanna to learn how to actions to connect to other websites via API
@linbintalk
@linbintalk 5 ай бұрын
Welcome to subscribe
@willsun5943
@willsun5943 7 ай бұрын
我感觉这个也适合做数据分析,针对数字类或者文字类都行
@linbintalk
@linbintalk 7 ай бұрын
可以试试
@yuancao7536
@yuancao7536 4 ай бұрын
巨赞术
@linbintalk
@linbintalk 4 ай бұрын
😄,感谢支持
@sorter1024
@sorter1024 Ай бұрын
我又來學習了
@linbintalk
@linbintalk Ай бұрын
🙏🏻如果内容对你有帮助,拜托给我的视频点个赞
@sorter1024
@sorter1024 Ай бұрын
@@linbintalk 必須點贊,做個標記
@kuisun4622
@kuisun4622 7 ай бұрын
只能抓一般架构的网站,遇到动态页面还有大量表格和图像的网站直接乱成一坨...之前用这个来抓取一个比较复杂的网站,搞了半天,最后还是自己写python
@linbintalk
@linbintalk 7 ай бұрын
这个综合能力我觉得不错,对小白很友好,方便简单。python会的有几个
@Thisnthat979
@Thisnthat979 5 ай бұрын
@@linbintalk 我购买了您的课程,也正在学习python 中,听说python连小学生都要学的?
@cssa2893
@cssa2893 4 ай бұрын
怎么解决IP会被封呢
@sanzhao
@sanzhao 2 ай бұрын
感谢提醒
@yellowbonbon1
@yellowbonbon1 7 ай бұрын
这个方法有可能依赖于FE 的layout 和structure, 和才算是。举一个极端的例子,“飞行最长时间” 与 “46分钟” 这两个dom 看其他是同一行,大多数coding 的写法都会把他们放到同一个div,so 他俩是siblings 关系。假如他俩不是这种关系,例如layout 是两大columns(一个column是label,另一个column 是value),AI 还会找到答案吗?(我可能表达不清,不好意思)
@linbintalk
@linbintalk 7 ай бұрын
它比想象的聪明,会筛选排查
@3170ccp
@3170ccp 7 ай бұрын
FE?
@logicai4928
@logicai4928 6 ай бұрын
@@3170ccp 这个方法可能会受到前端(Front End)的布局(layout)和结构(structure)的影响,以及他们之间的关系。举一个极端的例子,“飞行最长时间”和“46分钟”这两个DOM元素,如果在视觉上他们位于同一行,那么在大多数编程实践中,我们会将他们放入同一个div元素中,这样他们就成了兄弟关系。但如果他们的关系并非如此,比如布局是分为两大列(一列是标签,另一列是值),那么人工智能(AI)是否还能找到答案呢?(我可能没有表达得很清楚,对此表示歉意)。
@regman1100
@regman1100 7 ай бұрын
您好,我是使用win 11,已確認安裝好,因為版本也有顯示,但是執行npm start後,執行也有跑完,但是並沒有出現output.json檔案,不知道是哪出問題了。不知道學長有沒有甚麼解決方法?!
@linbintalk
@linbintalk 7 ай бұрын
这样判断不了
@shenzhouzhao
@shenzhouzhao Ай бұрын
npm start 执行过程中报错信息如下,请问如何解决? (node:85468) [DEP0040] DeprecationWarning: The `punycode` module is deprecated. Please use a userland alternative instead.
@jasonhe9475
@jasonhe9475 6 ай бұрын
这个工具是否适合爬类似Twitter、微博这样的信息?刚才试了一下都有登录限制,有没有什么办法绕过限制的?
@linbintalk
@linbintalk 6 ай бұрын
不能
@user-uf7of1er3h
@user-uf7of1er3h 6 ай бұрын
请问这个对于同一个 url 下多页面内容,有办法实现翻页抓取吗。按视频的方法试了一下,只能抓到第一页的内容
@linbintalk
@linbintalk 6 ай бұрын
翻页可以在后面加page,找到链接规律手动更改
@salesRoger
@salesRoger 4 ай бұрын
请问一下是否可以把最终爬取的数据,导出Excel的文件格式?
@linbintalk
@linbintalk 4 ай бұрын
json变成excel很简单,都是格式化的数据
@musicears66
@musicears66 7 ай бұрын
那如果直接把网站网址给gpt 他是不是直接抓取内容了?
@linbintalk
@linbintalk 7 ай бұрын
@aixizhang
@aixizhang 6 ай бұрын
有些网站是可以的,有些会说不让访问
@user-iv6nn6yj9h
@user-iv6nn6yj9h 7 ай бұрын
您好,我是win11用户,我的config文件里没有selector:‘.docs--builder-container’,这行字。是否可以自己添加进去?
@linbintalk
@linbintalk 7 ай бұрын
可以
@shader406
@shader406 7 ай бұрын
npm 1指令执行以后要下很多东西吗?我这边下不停了
@linbintalk
@linbintalk 7 ай бұрын
不会很久, 是 i
@user-er9xg1fc1i
@user-er9xg1fc1i 6 ай бұрын
假设问题对应的答案中涉及到图片,它也能正常显示吗?
@linbintalk
@linbintalk 6 ай бұрын
图片不能,只会抓地址
@user-bs9xx6sn6m
@user-bs9xx6sn6m 6 ай бұрын
我没有安装 Homebrew,按说明安装的,运行版本git version 2.39.3 (Apple Git-145),npm10.2.3,再下一步打开config.ts文件,我电脑上找不到这个文件,咋么办?
@user-bs9xx6sn6m
@user-bs9xx6sn6m 6 ай бұрын
找到了。
@linbintalk
@linbintalk 6 ай бұрын
🙆‍♂️
@leescott7667
@leescott7667 6 ай бұрын
有可能在不買PLUS的狀況(或先試用)下使用嗎 ?
@linbintalk
@linbintalk 6 ай бұрын
只要能上传附件就可以。
@leescott7667
@leescott7667 6 ай бұрын
@@linbintalk 謝謝 可是不買PLUS好像沒辦法上傳..
@leescott7667
@leescott7667 6 ай бұрын
還是有其他可以分析抓下來Vector JSON的地方?
@makisekurisu_jp
@makisekurisu_jp 6 ай бұрын
@@linbintalk沒有用,即使使用擴展工具上傳json檔案也不能讓chatgpt回答問題。
@user-bj8fk6rl7u
@user-bj8fk6rl7u 6 ай бұрын
如何识别哪些网站反爬?
@linbintalk
@linbintalk 6 ай бұрын
爬一下就知道了
@StreetdanceFung
@StreetdanceFung 5 ай бұрын
出了這一句 > cross-env NODE_ENV=development npm run build && node dist/src/main.js
@linbintalk
@linbintalk 5 ай бұрын
可以用ChatGPT查原因解决
@ningcai4703
@ningcai4703 15 күн бұрын
爬虫生成的是本地json格式的数据,coze只支持本地csv和json格式的在线API,怎么整?
@linbintalk
@linbintalk 15 күн бұрын
转换一下格式试试
@jason9072
@jason9072 7 ай бұрын
完全按照步骤安装了,版本也对了,但是运行后成功0个,失败1个,不知道哪里出问题了
@linbintalk
@linbintalk 7 ай бұрын
换个网址试试,可能配置不对
@zhezhang4394
@zhezhang4394 7 ай бұрын
GPT-Crawler 可以控制爬虫的爬取速度么?如果太快的话,部分网站会被限速
@linbintalk
@linbintalk 7 ай бұрын
目前不能
@user-xk4hu2zb5c
@user-xk4hu2zb5c 4 ай бұрын
可以解析某个网站的视频内容吗
@linbintalk
@linbintalk 4 ай бұрын
这个方法不行,有其他方式
@tanhuaiguo
@tanhuaiguo 6 ай бұрын
請問可以抓取抖音短視頻的字幕文件嗎?
@linbintalk
@linbintalk 6 ай бұрын
这个不能,但是有其他工具
@tanhuaiguo
@tanhuaiguo 6 ай бұрын
@@linbintalk 或者大大也做個教學視頻供菜鳥學習?😁
@user-sj7mf8kw2u
@user-sj7mf8kw2u 2 ай бұрын
谢谢博主,我有2个问题,第一是网站更新了怎么办;第二是我想采集多个网站怎么办呢?
@linbintalk
@linbintalk 2 ай бұрын
一个个操作
@user-sj7mf8kw2u
@user-sj7mf8kw2u 2 ай бұрын
@@linbintalk thx
@user-jc7zo9en2m
@user-jc7zo9en2m 7 ай бұрын
在运行中,发生路径错误,该怎么解决?
@fittzgu3597
@fittzgu3597 5 ай бұрын
同问
@user-tt7xx9ur1s
@user-tt7xx9ur1s 2 ай бұрын
只有gpt4 能这么做 还是3.5也能这么做呢?
@linbintalk
@linbintalk 2 ай бұрын
都可以
@htslong
@htslong 6 ай бұрын
需要登录的网页怎么办?比如语雀
@linbintalk
@linbintalk 6 ай бұрын
不行
@user-pq7cg6uu8y
@user-pq7cg6uu8y 5 ай бұрын
z抓整个京东的网站数据它能行吗😁
@linbintalk
@linbintalk 5 ай бұрын
动态不行
@soapman2533
@soapman2533 4 ай бұрын
我直接用coze 根本就不用本地跑代码 直接添加网站到知识库创建机器人😂
@linbintalk
@linbintalk 4 ай бұрын
那还是有差距的、这是批量整站
@dongliang6663
@dongliang6663 Ай бұрын
请问下能爬取谷歌学术吗
@linbintalk
@linbintalk Ай бұрын
需要登录的网站不行
@user-jc7zo9en2m
@user-jc7zo9en2m 7 ай бұрын
安装Homebrew后,验证,brew -v 显示找不到
@user-jc7zo9en2m
@user-jc7zo9en2m 7 ай бұрын
是不是安装国内镜像的好一些?
@linbintalk
@linbintalk 7 ай бұрын
Mac?找不到就是没装成功
@linbintalk
@linbintalk 7 ай бұрын
@wishrevealingdestiny
@wishrevealingdestiny 3 ай бұрын
can you teach mme how to do with youtube + python to craw all data in order to have the top view on my video ? hah
@ericchan2540
@ericchan2540 5 ай бұрын
在国內ChatGPT 不友好的屏蔽 应如何解决 谢谢
@linbintalk
@linbintalk 5 ай бұрын
和你看油管一个方案
@derikli5727
@derikli5727 5 ай бұрын
自己的模型都下载下来么?
@linbintalk
@linbintalk 5 ай бұрын
??
@aixizhang
@aixizhang 6 ай бұрын
博主能不能讲一期这些AI工具怎么结合电商🥺
@linbintalk
@linbintalk 6 ай бұрын
我关注一下先。
@user-vh6pr1sj4j
@user-vh6pr1sj4j 5 ай бұрын
​@@linbintalk我也需要,买会员学AI就是为了电商
@chacexu8213
@chacexu8213 3 ай бұрын
又没有离线版本的
@linbintalk
@linbintalk 3 ай бұрын
离线怎么访问网站,怎么获取数据?
@uubob7408
@uubob7408 6 ай бұрын
就则???
@linbintalk
@linbintalk 6 ай бұрын
就这
@makisekurisu_jp
@makisekurisu_jp 6 ай бұрын
影片教學不完整,到導出json檔案後沒有後續的教學,還需要升級到GPT PLUS並設定custom gpt,如果使用api則需要去設定custom assistant。
@linbintalk
@linbintalk 6 ай бұрын
可以用playground里面的assistant,用API就能上传知识库,并在线使用
@makisekurisu_jp
@makisekurisu_jp 6 ай бұрын
@@linbintalk 我看了你頻道的其他影片,直接使用lobe chat就可以採集網站資料了,不需要自己去安裝GPT Crawler☺️
@makisekurisu_jp
@makisekurisu_jp 6 ай бұрын
@@linbintalk 我有一個需要請教的問題,我在這部影片的留言看到你說可以不使用gpts和assistant,只要可以上傳檔案就能使用GPT Crawler,我有安裝ChatGPT File Uploader Extended這個擴展,怎樣在沒有gpt plus和api的情況下執行GPT Crawler,因為工作中沒有很需要,只是極少情況會用,不太想花錢。
@user-lb5fu1io4b
@user-lb5fu1io4b 3 ай бұрын
不好用,信息太杂了,GPT还是理解不了 我试了一下,数据需要数据清洗。就是找到content也不行
@linbintalk
@linbintalk 3 ай бұрын
这种一般配合知识库使用
@Douglas-f
@Douglas-f 6 ай бұрын
爬虫什么的python也能搞,没必要搬个项目吧,哈哈哈哈🤣,gpt4一个月20$,你也不说一下,等小白们搞完爬虫才发现gpts要充钱才能用😅
@linbintalk
@linbintalk 6 ай бұрын
python小白更不会,逻辑很返常识
@mazizhang831
@mazizhang831 3 ай бұрын
@@linbintalk你心知肚明ChatGPT付费账户才是关键,并且哪怕你觉得不是问题也应该有提示,你却为了流量只字不提,确实有点不厚道!浪费别人时间等于谋财害命知道吗?
@user-gj2yn5ux3z
@user-gj2yn5ux3z 2 ай бұрын
@@mazizhang831 大佬,所以说免费的chatgpt3.5不可以使用吗?不能用的我就不浪费时间去试了
@user-cq1wc5tz7c
@user-cq1wc5tz7c 6 ай бұрын
°∆ I believe we are meant to be like Jesus in our hearts and not in our flesh. But be careful of AI, for it is just our flesh and that is it. It knows only things of the flesh (our fleshly desires) and cannot comprehend things of the spirit such as peace of heart (which comes from obeying God's Word). Whereas we are a spirit and we have a soul but live in the body (in the flesh). When you go to bed it is your flesh that sleeps but your spirit never sleeps (otherwise you have died physically) that is why you have dreams. More so, true love that endures and last is a thing of the heart (when I say 'heart', I mean 'spirit'). But fake love, pretentious love, love with expectations, love for classic reasons, love for material reasons and love for selfish reasons that is a thing of our flesh. In the beginning God said let us make man in our own image, according to our likeness. Take note, God is Spirit and God is Love. As Love He is the source of it. We also know that God is Omnipotent, for He creates out of nothing and He has no beginning and has no end. That means, our love is but a shadow of God's Love. True love looks around to see who is in need of your help, your smile, your possessions, your money, your strength, your quality time. Love forgives and forgets. Love wants for others what it wants for itself. Take note, true love works in conjunction with other spiritual forces such as patience and faith (in the finished work of our Lord and Savior, Jesus Christ, rather than in what man has done such as science, technology and organizations which won't last forever). To avoid sin and error which leads to the death of our body and also our spirit in hell fire, we should let the Word of God be the standard of our lives not AI. If not, God will let us face AI on our own and it will cast the truth down to the ground, it will be the cause of so much destruction like never seen before, it will deceive many and take many captive in order to enslave them into worshipping it and abiding in lawlessness. We can only destroy ourselves but with God all things are possible. God knows us better because He is our Creater and He knows our beginning and our end. Our prove text is taken from the book of John 5:31-44, 2 Thessalonians 2:1-12, Daniel 2, Daniel 7-9, Revelation 13-15, Matthew 24-25 and Luke 21. Let us watch and pray... God bless you as you share this message to others.
@linbintalk
@linbintalk 6 ай бұрын
What?
Osman Kalyoncu Sonu Üzücü Saddest Videos Dream Engine 170 #shorts
00:27
The child was abused by the clown#Short #Officer Rabbit #angel
00:55
兔子警官
Рет қаралды 15 МЛН
OMG🤪 #tiktok #shorts #potapova_blog
00:50
Potapova_blog
Рет қаралды 17 МЛН
Just try to use a cool gadget 😍
00:33
123 GO! SHORTS
Рет қаралды 85 МЛН
聊天就能编程!我用GPTs做了个自己的数字化身
10:54
【Max_AI】Browse.AI:一键爬取网站所有数据,你值得拥有
6:56
The 2024 AI tool rankings are out! 10 Great AI Tools
13:18
AI 山本 研究所
Рет қаралды 3,6 М.
OpenAI停止对中国服务:背后原因与影响深度解析
23:12
老范讲故事
Рет қаралды 37 М.
Osman Kalyoncu Sonu Üzücü Saddest Videos Dream Engine 170 #shorts
00:27