proposition
Country:
- North America > United States > Missouri > Boone County > Columbia (0.13)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.13)
- Europe > Switzerland > Basel-City > Basel (0.04)
- (2 more...)
Genre:
- Research Report (1.00)
- Workflow (0.67)
Technology:
Reward Machines for Deep RL in Noisy and Uncertain Environments
Reward Machines provide an automaton-inspired structure for specifying instructions, safety constraints, and other temporally extended reward-worthy behaviour. By exposing the underlying structure of a reward function, they enable the decomposition of an RL task, leading to impressive gains in sample efficiency.
Country:
- North America > Canada > Ontario > Toronto (0.14)
- South America > Chile (0.04)
- Europe > Italy (0.04)
- (2 more...)
Industry:
- Transportation > Ground > Road (0.47)
- Information Technology (0.46)
- Government (0.46)
- Education (0.46)
Technology:
Country:
- North America > United States > California > Santa Clara County > Stanford (0.04)
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
Technology:
Country:
- Europe > Switzerland > Vaud > Lausanne (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Europe > Denmark (0.04)
Technology:
Country:
- Europe > France > Île-de-France > Paris > Paris (0.04)
- South America > Paraguay > Asunción > Asunción (0.04)
- North America > United States > Washington > King County > Bellevue (0.04)
- (5 more...)
Genre:
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (0.92)
Industry:
- Marketing (0.34)
- Information Technology > Services (0.34)
Technology:
Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Country:
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
- Asia > Middle East > Jordan (0.04)
- North America > United States > Wisconsin > Dane County > Madison (0.04)
- (6 more...)
Industry:
- Government (0.92)
- Health & Medicine > Pharmaceuticals & Biotechnology (0.46)
- Health & Medicine > Therapeutic Area (0.46)
Technology:
Active Bipartite Ranking
V arious dedicated algorithms have been recently proposed and studied by the machine-learning community. In contrast, active bipartite ranking rule is poorly documented in the literature. Due to its global nature, a strategy for labeling sequentially data points that are difficult to rank w.r.t. to the others is
Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)