{ "cells": [ { "cell_type": "markdown", "id": "aa629e55-8f41-41ab-b319-b55dd1cfc76b", "metadata": {}, "source": [ "# Playwright Scraper Showcase (Async in Jupyter)\n", "\n", "This notebook demonstrates how to run async Playwright-based scraping code inside JupyterLab using `nest_asyncio`.\n", "\n", "**Note:** Requires `openai_scraper_playwright.py` to be in the same directory." ] }, { "cell_type": "code", "execution_count": 1, "id": "97469777", "metadata": {}, "outputs": [], "source": [ "import nest_asyncio\n", "import asyncio\n", "nest_asyncio.apply()" ] }, { "cell_type": "code", "execution_count": 2, "id": "6254fa89", "metadata": {}, "outputs": [], "source": [ "from openai_scraper_playwright import EnhancedOpenAIScraper, analyze_content" ] }, { "cell_type": "code", "execution_count": 3, "id": "33d2737b", "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "### 1. Overall Summary of the Website:\n", "The website appears to be a hub for various applications of AI technology, particularly focusing on the capabilities of ChatGPT and other AI models developed by OpenAI. It offers a range of services from answering queries, assisting in planning trips, explaining technical topics, helping with language translation, and providing educational content. The site also features updates on new AI models, research publications, and business solutions integrating AI.\n", "\n", "### 2. Key Individuals or Entities:\n", "- **OpenAI**: Mentioned as the organization behind the development of AI models and technologies such as ChatGPT, GPT-4.1, and image generation models. OpenAI seems to be focused on advancing and applying AI in various fields.\n", "- **Lyndon Barrois & Sora**: Featured in a story, possibly highlighting individual experiences or contributions within the OpenAI ecosystem.\n", "\n", "### 3. Recent Announcements or Updates:\n", "- **Introducing our latest image generation model in the API** (Product, Apr 23, 2025)\n", "- **Thinking with images** (Release, Apr 16, 2025)\n", "- **OpenAI announces nonprofit commission advisors** (Company, Apr 15, 2025)\n", "- **Our updated Preparedness Framework** (Publication, Apr 15, 2025)\n", "- **BrowseComp: a benchmark for browsing agents** (Publication, Apr 10, 2025)\n", "- **OpenAI Pioneers Program** (Company, Apr 9, 2025)\n", "\n", "### 4. Main Topics or Themes:\n", "- **AI Model Development and Application**: Discusses various AI models like ChatGPT, GPT-4.1, and image generation models.\n", "- **Educational and Practical AI Uses**: Offers help in educational topics, practical tasks, and creative endeavors using AI.\n", "- **Business Integration**: Focuses on integrating AI into business processes, automating tasks in finance, legal, and other sectors.\n", "- **Research and Publications**: Shares updates on the latest research and publications related to AI technology.\n", "\n", "### 5. Any Noteworthy Features or Projects:\n", "- **GPT-4.1 and Image Generation Models**: Introduction of new and advanced AI models for text and image processing.\n", "- **OpenAI Pioneers Program**: A significant initiative likely aimed at fostering innovation and practical applications of AI technology.\n", "- **BrowseComp and PaperBench**: Research projects or benchmarks designed to evaluate and improve AI capabilities in specific domains.\n" ] } ], "source": [ "result = asyncio.run(analyze_content())\n", "print(result)" ] }, { "cell_type": "markdown", "id": "d7450ccf", "metadata": {}, "source": [ "โœ… If you see structured analysis above, the async code ran successfully in Jupyter!" ] }, { "cell_type": "markdown", "id": "9a46716c-6f77-4b2b-b423-cc9fe05014da", "metadata": {}, "source": [ "# ๐Ÿงช Playwright Scraper Output (Formatted)\n", "\n", "---\n", "\n", "## ๐Ÿงญ 1. **Overall Summary of the Website**\n", "\n", "*The website appears to be focused on showcasing various applications and updates related to OpenAI's technology, specifically ChatGPT and other AI tools. It provides information on product releases, company updates, and educational content on how to use AI technologies in different scenarios such as planning trips, learning games, coding, and more.*\n", "\n", "---\n", "\n", "## ๐Ÿง‘โ€๐Ÿ’ผ 2. **Key Individuals or Entities**\n", "\n", "- **OpenAI** โ€” Company behind the technologies and updates discussed on the website \n", "- **Lyndon Barrois & Sora** โ€” Featured in a story, possibly highlighting user experiences or contributions\n", "\n", "---\n", "\n", "## ๐Ÿ“ฐ 3. **Recent Announcements or Updates**\n", "\n", "- ๐Ÿ“ข **Introducing GPT-4.1 in the API** โ€” *(no date provided)*\n", "- ๐Ÿ–ผ๏ธ **Introducing 4o Image Generation** โ€” *(no date provided)*\n", "- ๐ŸŸ **Catching halibut with ChatGPT** โ€” *(no date provided)*\n", "- ๐Ÿง  **Thinking with images** โ€” *Apr 16, 2025*\n", "- ๐Ÿง‘โ€โš–๏ธ **Nonprofit commission advisors announced** โ€” *Apr 15, 2025*\n", "- โš™๏ธ **Updated Preparedness Framework** โ€” *Apr 15, 2025*\n", "- ๐ŸŒ **BrowseComp benchmark for browsing agents** โ€” *Apr 10, 2025*\n", "- ๐Ÿš€ **OpenAI Pioneers Program launched** โ€” *Apr 9, 2025*\n", "- ๐Ÿ“Š **PaperBench research benchmark published** โ€” *Apr 2, 2025*\n", "\n", "---\n", "\n", "## ๐Ÿ“š 4. **Main Topics or Themes**\n", "\n", "- ๐Ÿค– **AI Technology Applications** โ€” Using AI for tasks like planning, learning, and troubleshooting \n", "- ๐Ÿงฉ **Product and Feature Releases** โ€” Updates on new capabilities \n", "- ๐Ÿ“˜ **Educational Content** โ€” Guides for using AI effectively \n", "- ๐Ÿงช **Research and Development** โ€” Publications and technical benchmarks\n", "\n", "---\n", "\n", "## โญ 5. **Noteworthy Features or Projects**\n", "\n", "- โœ… **GPT-4.1** โ€” A new API-accessible version of the language model \n", "- ๐Ÿ–ผ๏ธ **4o Image Generation** โ€” Feature focused on AI-generated images \n", "- ๐Ÿš€ **OpenAI Pioneers Program** โ€” Initiative likely fostering innovation in AI \n", "- ๐Ÿ“Š **BrowseComp & PaperBench** โ€” Benchmarks for evaluating AI agents\n", "\n", "---\n", "\n", "โœ… *If you're reading this inside Jupyter and seeing clean structure โ€” your async notebook setup is working beautifully.*\n" ] }, { "cell_type": "markdown", "id": "95c38374-5daa-487c-8bd9-919bb4037ea3", "metadata": {}, "source": [] } ], "metadata": { "kernelspec": { "display_name": "Python 3 (ipykernel)", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.12.7" } }, "nbformat": 4, "nbformat_minor": 5 }