Skip to content

Latest commit

 

History

History
167 lines (121 loc) · 9.25 KB

migrate-bwdn.md

File metadata and controls

167 lines (121 loc) · 9.25 KB
copyright lastupdated
years
2015, 2017
2017-08-18

{:shortdesc: .shortdesc} {:new_window: target="_blank"} {:tip: .tip} {:pre: .pre} {:codeblock: .codeblock} {:screen: .screen} {:javascript: .ph data-hd-programlang='javascript'} {:java: .ph data-hd-programlang='java'} {:python: .ph data-hd-programlang='python'} {:swift: .ph data-hd-programlang='swift'}

Migrating from Watson Discovery News Original

A new version of {{site.data.keyword.discoverynewsshort}} debuted on 31, July 2017. The original version has been renamed {{site.data.keyword.discoverynewsshort}} Original and has been retired with a removal from service date of 15, January 2018.
{: shortdesc}

To migrate from {{site.data.keyword.discoverynewsshort}} Original to the new version, you need to make several changes, including updating any queries created for {{site.data.keyword.discoverynewsshort}} Original.

Note: If you have created a new instance of {{site.data.keyword.discoveryshort}}, you will only have access to the new version of {{site.data.keyword.discoverynewsshort}}. Access to both new {{site.data.keyword.discoverynewsshort}} and {{site.data.keyword.discoverynewsshort}} Original is only available in instances of {{site.data.keyword.discoveryshort}} created before 31, July 2017.

See Watson Discovery News for a description of this collection.

For a description and information about querying {{site.data.keyword.discoverynewsshort}} Original, see Watson Discovery News Original.

Service comparison

{{site.data.keyword.discoverynewsshort}} Original {{site.data.keyword.discoverynewsshort}}
{{site.data.keyword.discoverynewsshort}} Original is pre-enriched with the following Alchemy Language enrichments: Keyword Extraction, Entity Extraction, Concept Tagging, Relation Extraction, Sentiment Analysis, and Taxonomy Classification. The following additional metadata is also added: crawl date, publication date, URL ranking, host rank, and anchor text. {{site.data.keyword.discoverynewsshort}} is pre-enriched with the following {{site.data.keyword.nlushort}} (NLU) enrichments: Keyword Extraction, Entity Extraction, Semantic Role Extraction, Sentiment Analysis, Relations, and Category Classification. The following additional metadata is also added: crawl date and publication date. To learn more about NLU enrichments, see Adding enrichments.
{{site.data.keyword.discoverynewsshort}} Original was accessible through an environment that was unique to your service instance. When using {{site.data.keyword.discoverynewsshort}}, all users query the same environment and collection. This means that all references to your environment and collection need to be changed.
In {{site.data.keyword.discoverynewsshort}} Original, you receive information such as collection size, number of documents, etc. when retrieving the environment via the API. {{site.data.keyword.discoverynewsshort}} API does not return this information.

The following new fields are available in {{site.data.keyword.discoverynewsshort}}:

enriched_text.relations.arguments.text
enriched_text.relations.score
enriched_text.relations.sentence
enriched_text.relations.type
enriched_title.relations
enriched_title.relations.arguments
enriched_title.relations.arguments.entities
enriched_title.relations.arguments.entities.text
enriched_title.relations.arguments.entities.type
enriched_title.relations.arguments.text
enriched_title.relations.score
enriched_title.relations.sentence
enriched_title.relations.type
external_links
extracted_metadata
extracted_metadata.file_type
extracted_metadata.filename
extracted_metadata.sha1
forum_title
main_image_url

Many fields have been removed as well, for example blekko.hostrank, duplicate_url, domain, and more. See HERE External link icon for a complete list.

Moving Queries to the new Watson Discovery News

To move your queries from {{site.data.keyword.discoverynewsshort}} Original to the new {{site.data.keyword.discoverynewsshort}}, you need to modify all existing queries in the following ways:

  • Change the environment ID that the query is calling. The news environment name has been standardized across all {{site.data.keyword.discoveryshort}} service instances to:

    system

  • Change the collection ID that the query is calling. The news collection name has been standardized across all {{site.data.keyword.discoveryshort}} service instances to:

    news

  • Modify the query to use the new JSON path structure for the new {{site.data.keyword.discoverynewsshort}}. Most fields have changed paths, multiple fields have been added, and a selected group of low-value fields have been removed. See the field migration spreadsheet for full details HERE External link icon). For example, the following query:

    discovery/api/v1/environments/ae5790c2-592f-432a-804a-ee16de7154d7/collections/3edcd8f1-e25a-4f44-a069-58332ad17651/query?version=2017-08-01&query=entities.type:"Company"

    Should be changed to:

    discovery/api/v1/environments/system/collections/news/query?version=2017-08-01&query=enriched_text.entities.type:"Company"

Querying Watson Discovery News

You can query {{site.data.keyword.discoverynewsshort}} by using the API or one of the {{site.data.keyword.watson}} SDKs. Additionally, you can use the query building tooling to interactively construct queries.

To launch the {{site.data.keyword.discoveryshort}} tooling and query {{site.data.keyword.discoverynewsshort}}:

  1. Navigate to the Manage tab of your {{site.data.keyword.discoveryshort}} service instance.
  2. Click the Launch Tooling button.
  3. Click on the {{site.data.keyword.discoverynewsshort}} tile to open the {{site.data.keyword.discoverynewsshort}} overview page.
  4. Click Query this collection to open the My Data Insights screen. Click Build your own query to open the query builder.

Queries in {{site.data.keyword.discoverynewsshort}} are structured the same way as queries written for private data collections. See Building Queries and Delivering Content and Query Building Reference. {: tip}

Note: Do not expect identical results to be returned for similar queries in {{site.data.keyword.discoverynewsshort}} Original and {{site.data.keyword.discoverynewsshort}}. Crawl time, sources, and enrichments all combine to return different results.

Adding Watson Discovery News queries to your application

Use one of the following methods to add queries to your application. All of these examples query for enriched_text.entities with a text value of IBM (enriched_text.entities.text:IBM).

In all of the following examples, replace {username} and {password} with the username and password that are listed in the Service Credentials page of your service instance.

Using direct calls to the API

curl -u "{username}":"{password}" 'https://gateway.watsonplatform.net/discovery/api/v1/environments/system/collections/news/query?version=2017-08-01&query=enriched_text.entities.text:IBM'

{: pre}

Using the Watson Java SDK

Discovery discovery = new Discovery("2017-08-01");  
discovery.setEndPoint("https://gateway.watsonplatform.net/discovery/api/v1");
discovery.setUsernameAndPassword("{username}", "{password}");  
String environmentId = "system";
String collectionId = "news";

QueryRequest.Builder queryBuilder = new QueryRequest.Builder(environmentId, collectionId);  
queryBuilder.query("enriched_text.entities.text:IBM");  
QueryResponse queryResponse = discovery.query(queryBuilder.build()).execute();

{: codeblock}

Using the Watson Node.js SDK

var watson = require('watson-developer-cloud');  

var discovery = new DiscoveryV1({  
  username: '{username}',  
  password: '{password}',  
  version_date: '2017-08-01'  
});  

discovery.query(('system', 'news', 'enriched_text.entities.text:IBM'),  
  function(error, data) {  
    console.log(JSON.stringify(data, null, 2));  
  }
);

{: codeblock}

Using the Watson Python SDK

import sys  
import os  
import json  
from watson_developer_cloud import DiscoveryV1  

discovery = DiscoveryV1(  
  username="{username}",  
  password="{password}",  
  version="2017-08-01"  
)  

qopts = {'query': 'enriched_text.entities.text:IBM'}  
my_query = discovery.query('system', 'news', qopts)  
print(json.dumps(my_query, indent=2))  

{: codeblock}